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Description 
Field of the Invention 

[0001] The present invention relates generally to the dental and biomedical arts. In certain embodiments, the inven- 
tion more particularly relates to methods and compositions for stimulating mammalian odontoblasts and inducing mor- 
phogenesis of mammalian dentine. 

Background of the Invention 

[0002] In mammals, periodontal disease, such gingivitis, can arise from the weakening of periodontal tissue by in- 
fectious agents (e.g., buccal microorganisms), nutritional deficiency (e.g. scurvy), or neoplastic disease (e.g., leukemia 
and lymphoma). Periodontal diseases often are characterized by inflammation, bleeding, tissue recession and/or ul- 
ceration. If not properly treated, periodontal diseases can contribute to tooth loss. For example, gingival lesions can 
arise where bacterial plaque adheres to the tooth/gingiva interface and provokes local inflammation and/or recession 
of the gingiva. In the early stages, gingivitis is associated with tooth sensitivity to perception of pressure and/or tem- 
perature. For example, afflicted teeth may ache upon contact with cold or hot stimuli. If untreated, this progresses to 
severe continual throbbing pain, ultimately associated with infection of the tooth pulp tissue, periodontal ligament, or 
alveolar bone of the tooth socket. More severe complications, e.g., endocarditis, can arise where untreated lesions 
provide buccal microorganisms with a portal of entry into the afflicted individual's bloodstream. Harrison's Principles 
of Internal Medicine, 12th edition, 1991 (Wilson et al. , eds.), pp. 242-243. Current treatments include professional 
cleaning to remove plaque and tartar, use of oral antiseptics, local and/or systemic antibiotic therapies, and/or surgical 
procedures to remove periodontal pockets formed from periodontal tissue lesions and necrosis. Gingivitis thus is treated 
by debridement of lesioned gingiva and the affected tooth or tooth root surface adjoining the lesion site. Treated gingival 
lesions heal through the formation of scar tissue at the lesion site. Where tooth loss is imminent or has already occurred 
as a result of periodontal disease, a prosthetic tooth or removable bridge is substituted for the natural tooth. 

[0003] Dental caries also is generally attributable to the weakening of tooth* tissue by infectious agents or nutritional 
causes. A cavity, or carious lesion, often involves colonization and degradation of mineralized tooth tissue (e.g., enamel 
or dentine) by buccal microorganisms. If untreated, the lesion site expands and can weaken, permeating the mineralized 
tooth wall and placing the tooth pulp tissue at risk of infection. Thus, an untreated carious lesion site also can provide 
buccal microorganisms with a portal of entry into the bloodstream. Conventional treatments for dental caries include 
ablation of lesioned dentine to expose a fresh surface of unaffected residual dentine, followed by sealing and restoration 
with an inert material suitable for dental use, e.g., silver amalgam, composite plastic, gold or porcelain. If infection has 
spread to the pulp tissue, it becomes necessary to extract the tooth or remove the contents of the pulp chamber and 
root canals prior to sealing and reconstruction with inert materials. Both approaches require the construction of per- 
manent dental prostheses, such as bridges or crowns, which can become brittle over time. 

[0004] Previously disclosed are methods and compositions capable of inducing periodontal tissue morphogenesis 
and dentinogenesis in a mammal, including a therapeutically effective concentration of a morphogen (U.S.S.N. 
08/155,343 (published as WO94/06399)). Yet, needs remain for improved treatment of dental caries and periodontal 
disease, including gingivitis. Particular needs remain for improved treatment methods and compositions which mitigate 
loss of teeth and associated tissue, including dentine, gingiva and pulp tissue. Still more particular needs remain for 
improved methods and compositions which allow for the regenerative healing of functional dental tissues following 
resection of carious or periodontal lesions, including dentine tissue, pulp, cementum, periodontal ligament, gingiva and 
the like. 

Summary of the Invention 

[0005] It is an object of this invention to provide means for inhibiting loss of dental tissue in mammals, as well as 
means for inducing regeneration thereof. It is an object of the present invention to provide means for stimulating pro- 
liferation and differentiation of odontoblasts in mammals, particularly primates. It also is an object of the present in- 
vention to provide means for stimulating expression of the odontoblast phenotype, including production of mineralized 
dentine matrix, by mammalian tooth pulp tissue, including primate tooth pulp tissue such as human tooth pulp tissue. 
Another object is to provide means for inhibiting the periodontal tissue damage and tooth loss associated with perio- 
dontal and other gum diseases, including gingivitis. Additional objects include providing means for desensitizing teeth 
to perception of pressure or temperature, as well as for sealing a tooth cavity by inducing formation of reparative dentine 
tissue. These and other objects, along with advantages and features of the invention disclosed herein, will be apparent 
from the description, drawings and claims that follow. 

[0006] The invention is defined in the claims appended hereto. The invention provides methods and compositions 
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for inhibiting periodontal and tooth tissue (collectively, dental tissue) loss in a mammal, particularly a human, including 
regenerating damaged tissue and/or inhibiting additional damage thereto.. The methods and compositions of this in- 
vention can be used to prevent and/or inhibit tooth loss associated with gingivitis and other periodontal diseases. The 
present methods and compositions also can be used to desensitize teeth to perception of pressure and/or temperature, 
5 and pain associated, therewith in dental caries and gingivitis. The invention further provides methods and compositions 
for stimulating morphogenesis of mammalian dentine, including stimulating proliferation and differentiation of odonto- 
blasts. In particular, the invention provides methods and compositions for stimulating expression of the odontoblast 
phenotype, including production of dentine matrix, by tooth pulp tissue in mammals, including primates. The present 
invention can be used to seal a cavity in a mammalian tooth by inducing the formation of reparative dentine. Thus, the 
10 invention reduces the need for tooth extraction or root canal therapy as treatments for dental caries or other dental 
damage in which pulp tissue is placed at risk. 

[0007] The methods and compositions of this invention capitalize on the discovery that certain proteins of eukaryotic 
origin, defined herein as morphogens, induce morphogenesis of functional cells, tissues and organs in higher eukary- 
otes, particularly mammals, including humans. That is, morphogens induce or reinduce the fully integrated develop- 
15 mental cascade of cellular and molecular morphogenetic events that culminate in the formation of fully differentiated, 
functional tissue of a type appropriate to the context or local environment in which morphogenesis is induced, including 
any vascularization, connective tissue formation, ennervation and the like characteristic of the naturally-occurring tis- 
sue. Morphogenesis therefore differs significantly from simple reparative healing processes in which scar tissue (e.g., 
fibrous connective tissue) is formed and fills a lesion, or other defect in differentiated, functional tissue. Further, mor- 
20 phogenesis occurs in a "permissive environment" by which is meant a local environment that does not stifle or suppress 
morphogenesis (e.g., regeneration or regenerative healing). Permissive environments exist, e.g., in embryonic tissue 
or in wounded or diseased tissue, including tissue subjected to surgical intervention. Often, a permissive environment 
comprises a suitable matrix or substratum to which cells undergoing differentiation can anchor. Other components of 
a permissive environment typically include signals, e.g., cell surface markers or extracellular matrix components, that 
25 direct the tissue specificity of differentiation. 

[0008] Generally, morphogens are dimeric proteins that induce morphogenesis of one or more eukaryotic (e.g., mam- 
malian) cells, tissues or organs. Of particular interest herein are morphogens that induce morphogenesis at least of 
mammalian dentine, including formation of reparative dentine at or apposite to a dental or periodontal lesion site in a 
mammalian tooth. Morphogens comprise a pair of polypeptides that, when folded, adopt a configuration sufficient for 
30 the resulting dimeric protein to elicit morphogenetic responses in cells and tissues displaying receptors specific for 
said morphogen. That is, morphogens generally induce all of the following biological functions in a morphogenically 
permissive environment: stimulating proliferation of progenitor cells; stimulating the differentiation of progenitor cells; 
stimulating the proliferation of differentiated cells; and supporting the growth and maintenance of differentiated cells. 
"Progenitor" cells are uncommitted cells that are competent to differentiate into one or more specific types of differen- 
35 tiated cells, depending on their genomic repertoire and the tissue specificity of the permissive environment in which 
morphogenesis is induced. Morphogens further can delay or mitigate the onset of senescence- or quiescence-asso- 
ciated loss of phenotype and/or tissue function. Morphogens still further can stimulate phenotypic expression of differ- 
entiated cells, including expression of metabolic and/or functional, e.g., secretory, properties thereof. In addition, mor- 
phogens can induce redifferentiation of committed cells under appropriate environmental conditions. As noted above, 
40 morphogens that induce proliferation and differentiation at least of mammalian odontoblasts, and/or support the growth, 
maintenance and functional properties of mammalian odontoblasts, including the formation of dentine matrix, are of 
particular interest herein. For purposes of the present invention, an "odontoblast" is any differentiated cell occurring or 
arising in mammalian tooth pulp. tissue, that is competent to produce dentine matrix. 

[0009] In preferred embodiments, the pair of morphogen polypeptides have amino acid sequences each comprising 
45 a sequence that shares a defined relationship with an amino acid sequence of a reference morphogen. Herein, preferred 
morphogen polypeptides share a defined relationship with a sequence present in morphogenically active human OP- 
1, Seq. ID No. 4. However, any one or more of the naturally occurring or biosynthetic sequences disclosed herein 
similarly could be used as a reference sequence. Preferred morphogen polypeptides share a defined relationship with 
at least the C-terminal six cysteine domain of human OP-1, residues 43-139 of Seq. ID No. 4. Preferably, morphogen 
50 polypeptides share a defined relationship with at least the C-terminal seven cysteine domain of human OP-1 , residues 
38-139 of Seq. ID No. 4. That is, preferred morphogen polypeptides in a dimeric protein with morphogenic activity each 
comprise a sequence that corresponds to a reference sequence or is functionally equivalent thereto. 

[0010] Functionally equivalent sequences include functionally equivalent arrangements of cysteine residues dis- 
posed within the reference sequence, including amino acid insertions or deletions which alter the linear arrangement 
55 of these cysteines, but do not materially impair their relationship in the folded structure of the dimeric morphogen 
protein, including their ability to form such intra- or inter-chain disulfide bonds as may be necessary for morphogenic 
activity. Functionally equivalent sequences further include those wherein one or more amino acid residues differs from 
the corresponding residue of a reference morphogen sequence, e.g., the C-terminal seven cysteine domain (also 
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referred to herein as the conserved seven cysteine skeleton) of human OP-1 , provided that this difference does not 
destroy morphogenic activity. Accordingly, conservative substitutions of corresponding amino acids in the reference 
sequence are preferred. Amino acid residues that are "conservative substitutions" for corresponding residues in a 
reference sequence are those that are physically or functionally similar to the corresponding reference residues, e.g., 
5 that have similar size, shape, electric charge, chemical properties including the ability to form covalent or hydrogen 
bonds, or the like. Particularly preferred conservative substitutions are those fulfilling the criteria defined for an "ac- 
cepted point mutation" in Dayhoff et al. (1978), 5 Atlas of Protein Sequence and Structure, Suppl. 3, ch. 22 (pp. 354-352), 
Natl. Biomed. Res. Found., Washington, D.C. 20007. 

[0011] In certain embodiments, a polypeptide suspected of being functionally equivalent to a reference morphogen 
io polypeptide is aligned therewith using the method of Needleman et al. (1 970), 48 J.Mol. Biol. 443-453, implemented 
conveniently by computer programs such as the Align program (DNAstar, Inc). As noted above, internal gaps and 
amino acid insertions in the candidate sequence are ignored for purposes of calculating the defined relationship, con- 
ventionally expressed as a level of amino acid sequence homology or identity, between the candidate and reference 
sequences. “Amino acid sequence homology” is understood herein to mean amino acid sequence similarity. Homolo- 
15 gous sequences share identical or similar amino acid residues, where similar residues are conservative substitutions 
for or "allowed point mutations" of corresponding amino acid residues in an aligned reference sequence. Thus, a 
candidate polypeptide sequence that shares 70% amino acid homology with a reference sequence is one in which any 
70% of the aligned residues are either identical to or are conservative substitutions of the corresponding residues in 
a reference sequence. 

20 [0012] Of particular interest herein are morphogens, which, when provided to the tooth and/or jawbone surfaces in 

a mammalian tooth socket, induce periodontal tissue formation where periodontal tissue has been lost or damaged. 
Of still more particular interest herein are morphogens which, when applied to a tooth surface, such as a dentinal 
surface, induce morphogenesis of new or reparative dentine. Such morphogens can be used to seal a tooth cavity or 
to desensitize a tooth to perception of pressure and/or temperature. 

25 [001 3] The present invention alternatively can be practiced with methods and compositions comprising a morphogen 

stimulating agent in lieu of a morphogen. A "morphogen stimulating agent" is a compound that stimulates in vivo pro- 
duction, e.g., expression, of a therapeutically effective concentration of an endogenous morphogen in the body of the 
mammal sufficient to regenerate damaged dental tissue and/or to inhibit additional damage thereto. Such compounds 
are understood to include substances which, when administered to a mammal, act on cells of tissue(s) or organ(s) that 
30 normally are competent to produce and/or secrete a morphogen encoded within the genome of the mammal, and which 

cause the endogenous level of the morphogen in the mammal's body to be altered. Endogenous or administered 
morphogens can act as endocrine, paracrine or autocrine factors. That is, endogenous morphogens can be synthesized 
by the cells in which morphogenetic responses are induced, by neighboring cells, or by cells of a distant tissue, in 
which circumstances the secreted endogenous morphogen is transported to the site of morphogenesis e.g., by the 
35 individual's bloodstream. In preferred embodiments, the agent stimulates expression and/or secretion of an endog- 
enous morphogen so as to increase amounts thereof in dental tissues, such alveolar bone, periodontium, cementum, 
dentine or pulp tissue cells. 

[0014] In certain preferred aspects of the present invention, the morphogens described herein can induce regener- 
ation of damaged or lost dentine tissue in a mammalian tooth. The morphogen can be provided topically or otherwise 
40 administered to the tooth tissue. For example, the morphogen can be dispersed in a biocompatible, porous carrier 
material that then is provided topically to the damaged dentine tissue. A useful carrier can be formulated from suitable 
organ specific tissue, e.g., bone or dentine, by demineralizing and guanidine-extracting the tissue to create an acellular 
matrix as described in U.S.S.Nos. 07/971,091 (published as W094/10203), 08/155,343 (published as WO94/06399) 
and 08/1 74,605 (published as W094/06420). Synthetic materials also can be used. In some embodiments, the existing 
45 tooth tissue provides a suitable matrix. If a formulated matrix or carrier is used, it should be a biocompatible, suitably 
modified acellular matrix having dimensions such that it allows the differentiation and proliferation of migratory progen- 
itor cells, and contributes to a morphogenically permissive environment. Preferably, the matrix allows cellular attach- 
ment and is biodegradable or bioresorbable. Where the tissue locus to which the morphogen and matrix are applied 
lacks sufficient endogenous signals to direct the tissue specificity of morphogenesis, the matrix preferably further com- 
50 prises tissue-specific components or is derived from tissue of the desired type. Matrices can be generated from dehy- 
drated organ-specific tissue by, e.g., treating the tissue with solvents to substantially remove the cellular, non-structural 
components therefrom. Alternatively, the matrix can be prepared from a biocompatible, in vivo biodegradable structural 
molecule, optionally formulated with suitable tissue-specific cell attachment factors. Thus, collagen, laminin, hyaluronic 
and/or the like, can be used, as can synthetic polymers or copolymers of polylactic acid, polybutyric acid, polyglycolic 
55 acid and the like. Currently preferred structural molecules include tissue-specific collagens. Currently preferred cell 
attachment factors include glycosaminoglycans and proteoglycans. Preferably collagens, glycosaminoglycans and/or 
proteoglyceans are used that are of the same types as those that are naturally found in dental tissues. If needed, the 
matrix can be treated with an agent effective for enhancing porosity thereof, so as to create a scaffold structure suitable 
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for cell influx and attachment. 

[ 0015 ] Alternatively, the morphogen can be applied in association with a carrier that maintains the morphogen sub- 
stantially at the site of application, and/or enhances the controlled delivery of morphogen substantially at the site at 
which morphogenesis is to be induced. Such carriers also are disclosed in U.S.S.Nos. 07/971,091 (published as 
5 W094/1 0203), 08/1 55,343 (published as WO94/06399) and 08/174,605 (published as W094/06420). Useful carriers 

include compositions having a high viscosity, such as that provided by glycerol and the like, as well as carrier materials 
formulated from extracellular matrices and/or which contain laminin, collagen, and/or biocompatible synthetic polymers, 
such as polybutyric, polylactic, polyglycolic acids and copolymers thereof. 

[0016] Accordingly, the present morphogens can be used to stimulate morphogenesis of new or reparative dentine 
10 in a mammalian tooth, including the formation of dentine matrix by mature, differentiated or newly formed odontoblasts, 
i.e., by competent cells of the tooth pulp tissue. That is, the present morphogens can stimulate proliferation, differen- 
tiation and/or phenotypic expression of mammalian cells competent to elaborate dentine matrix, including odontoblasts 
and/or pulp connective tissue ceils. This morphogenetic activity is responsible for the formation of reparative dentine 
in mammalian teeth. Thus, the present morphogens can be used to increase thickness of a mammalian tooth wall; that 
15 is, to increase the thickness of mineralized tissue (dentine, enamel and/or cementum) separating viable tooth pulp 
tissue from the buccal environment. As a result, the present morphogens can be used to reduce the risk of tooth wall 
fracture, particularly at sites where the tooth wall is thin or weakened due to association with a gingival lesion site or 
a cavity. 

[ 0017 ] Thus, the present invention can be used to seal a tooth cavity, up to and including a Stage V cavity, in a 
20 mammalian tooth, particularly a primate tooth such as a human tooth. Carious tissue preferably is ablated from the 
cavity site to expose a fresh surface of residual dentine therein, preferably transverse to luminae of dental canaliculi 
within the tooth. The residual dentine surface preferably is located up to about 1 mm, more preferably up to about 0.5 
mm, still more preferably up to about 0.2 mm from the pulp chamber wall (i.e., from a mature odontoblast layer at the 
dentine/pulp interface). Application of a morphogen to this surface prior to or concurrently with tooth reconstruction, 
25 including filling of the site of the carious lesion with a suitable material, induces formation of reparative dentine matrix 
within the reconstructed tooth. In this manner, risk of fracture in the residual dentine, and subsequent treatment by root 
canal therapy or tooth extraction, can be avoided. 

[ 0018 ] Similarly, the present invention can be used to desensitize mammalian teeth to perception of pressure and/ 
or temperature in an individual afflicted with periodontal disease, e.g., gingivitis. Following debridement of surfaces 
30 within a gingival lesion, including removal of bacterial plaque or tartar, a morphogen is applied to an exposed dentinal 
surface therein, preferably in an amount effective for stimulating formation of reparative dentine apposite said surface. 
Reparative dentine so formed can be within or external to the pulp chamber of the treated tooth, and serves as an 
enhanced protective barrier between the pulp tissue and the buccal environment. Further, morphogen applied to a 
healthy gingival surface adjoining the lesion site promotes gingival regeneration and/or retards gingival recession. 

35 [ 0019 ] In the above-mentioned embodiments, morphogens or morphogen stimulating agents are applied, e.g., top- 

ically or by local injection, to a tooth surface e.g., a dentinal surface. Preferably, the surface is transverse to luminae 
of dental canaliculi within naturally formed tooth dentine, such that fluid microcontact can be established between 
applied morphogen and odontoblasts or pulp tissue present within the tooth. The morphogen can be applied solubilized 
or otherwise dispersed (e.g., as a colloidal suspension or emulsion) in a physiologically compatible liquid vehicle, e. 
40 g., comprising physiological saline solution, or in a vehicle, e.g., comprising ethanol, that evaporates under physiological 

conditions to leave a morphogen residue adsorbed on the tooth surface. Alternatively, the morphogen can be sorbed 
on a matrix such as a biocompatible, acellular matrix suitable for sealing or filling defects in mammalian teeth, e.g., as 
described above. Morphogen-sealed defects can, if desired, be filled or reconstructed to restore original tooth dimen- 
sions using conventional dental reconstruction materials. 

45 [0020] In all such embodiments, the morphogen-treated dentinal surface should be rendered essentially free of buc- 

cal microoganisms, and aseptic conditions should be maintained in the treated locus during the time period in which 
morphogenetic activity is induced. 

[0021 ] Morphogens and morphogen-stimulating agents of the present invention also can be provided to periodontium 
and/or tooth tissues together with other molecules ("cofactors") known to have a beneficial effect in treating damaged 
so dental tissues, particularly cofactors capable of mitigating or alleviating symptoms typically associated with dental 
tissue damage and/or loss. Examples of such cofactors include antiseptics such as chlorohexidine and tibezonium 
iodide, antibiotics, including tetracycline, aminoglycosides, macrolides, penicillins and cephalosporins, anaesthetics 
and analgesics, and other non-steroidal anti-inflammatory agents. 

55 Brief Description of the Drawings 

[0022] The foregoing and other objects, features and advantages of the present invention, as well as the invention 
itself, will be more fully understood from the following description of preferred embodiments, when read together with 
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the accompanying drawings, in which: 

[0023] FIGURE 1 is a schematic illustration of a healthy mammalian tooth in a tooth socket. 

[0024] FIGURE 2, panels 2-1 through 2-12, depicts alignment of sequences of various naturally occurring morpho- 
gens with a preferred reference sequence of human OP-1, residues 38-139 of Seq. ID No. 4. Morphogens shown in 
5 FIGURE 4 also are identified in Table I, above and in the Sequence Listing. 

[0025] FIGURE 3 is a digitized video image of a typical tissue section through a primate tooth treated with morphogen, 
and shows morphogen-induced reparative dentine therein. Bar is 0.5 mm, original magnification 2.5x. 

[0026] FIGURE 4 is a bar graph illustration of results establishing that morphogen stimulation of new or reparative 
dentine formation is dose dependent. In this figure, the dose applied of recombinant human OP-1 is shown in pg on 
10 the X-axis, and the surface area in mm of induced dentine is shown on the Y-axis. 

[0027] FIGURE 5 is a line graph illustration of results establishing that morphogen stimulates new or reparative 
dentine formation under thin bridges of residual natural dentine. In this figure, equivalent amounts (e.g., 10pg) of re- 
combinant human OP-1 were applied to residual dentine bridges of the thicknesses shown along the X-axis. 

[0028] FIGURE 6 is a line graph illustration of results comparing the effects of recombinant human OP-1 to a con- 
15 ventional agent, Ca(OH) 2 , on stimulation of new or reparative dentine under thin bridges of residual dentine. Here as 
well, equivalent amounts (e.g., 1 0|xg) of recombinant human OP-1 or Ca(OH) 2 were applied as indicated to residual 
dentine bridges of the thicknesses shown on the X-axis. 

Detailed Description of Preferred Embodiments 

20 

[0029] It has been discovered that the morphogens described herein can stimulate tissue formation, including mor- 
phogenesis or regeneration of lost or damaged mammalian dental tissue, including dentine. The invention can be used 
to desensitize teeth, retard gingival recession, seal cavities, increase thickness of the tooth wall, and reduce the risk 
of tooth wall fracture. The invention is practiced using a morphogen or morphogen-stimulating agent, as defined herein, 
25 according to the procedures described herein. 

[0030] Provided below is a description of tooth anatomy and useful morphogens, including methods for their produc- 
tion and formulation, as well as exemplary, non-limiting examples which (1) demonstrate the suitability of the morph- 
ogens described herein in the methods of the invention, and (2) provide assays with which to test candidate morphogens 
for their efficacy. 

30 

L Tooth Anatomy 

[0031] A vertical section of a mammalian tooth in the tooth socket is shown schematically in FIGURE 1 . The crown 
6 of the tooth is composed of enamel 8 and dentine 22. The pulp chamber 1 2 is seen in the interior of the crown 6 and 
35 the center of the root 10; it extends downward into the bony area 14, 16, 18 and opens by a minute orifice, the apical 
foramen 20, at the extremity of the root 10. The pulp chamber 12 contains dental pulp, a loose connective tissue richly 
supplied with blood vessels and nerves, entering the chamber through the apical foramen 20. Some of cells of the pulp 
tissue, i.e., odontoblasts, the precursors of dentine 22, are arranged generally as a layer on the wall of the pulp chamber 
12. During development of the tooth, odontoblasts are columnar, but later, after the dentine 22 is fully formed, they 
40 become flattened and resemble osteoblasts. 

[0032] The solid portion or mineralized wall of the mature tooth includes dentine 22, enamel 8, and a thin layer of 
cementum 24, which is disposed on the surface of the root 25. Enamel 8 is formed during development of the tooth 
from amyloblasts, and cementum 24 is formed from cementoblasts. In a fully developed tooth, the principal mass of 
the tooth comprises dentine 22, which is made up of hydroxyapatite crystals embedded in a strong meshwork of collagen 
45 fibers. The dentine includes a number of minute wavy and branching tubes called dental canaliculi, embedded in a 
dense homogeneous substance, the matrix. The dental canaliculi are parallel with one another and open at their inner 
ends into the pulp chamber 12. The dentine matrix is translucent and comprises the majority of the inorganic mass of 
the dentine. It includes a number of fine fibrils, which are continuous with the fibrils of the dental pulp. After the inorganic 
matter has been removed by steeping a tooth in weak acid, the remaining organic matter may be torn into laminae that 
50 run parallel with the pulp chamber 12 across the direction of the tubes. 

[0033] The cementum 24 is disposed as a thin mineralized layer covering the tooth root. It extends from where the 
enamel terminates to the apex of each root, where it is usually very thick. Cementum resembles bone in structure and 
chemical composition in that it contains, sparingly, the lacunae and canaliculi that characterize true bone; in the thicker 
portions of the cementum, the lamellae and Haversian canals peculiar to bone also are found. As a result of aging, the 
55 cementum increases in thickness and the pulp chamber also becomes partially filled with a hard substance that is 
intermediate in structure between dentine and bone (referred to herein as "osteodentine"). It appears to be formed by 
a slow conversion of the dental pulp, which shrinks or even disappears. 

[0034] The periodontal ligament, or periodontal membrane 26, is the layer of periodontal tissue which forms a cushion 
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between the cementum 24 and the bone 1 4, 1 6, 1 8; it holds the tooth in position by suspending it in the socket (alveolus) 
of the jawbone. The periodontal ligament is a highly organized tissue which is formed from periodontal fibroblasts. It 
organizes the collagen fibers which pass directly from the bone of the jaw into the cementum. 

[0035] Thus, as used herein, "tooth" refers to a natural or synthetic composition essentially defining the shape of a 
5 natural mammalian tooth, having a solid tooth body, including a crown and tooth root. "Periodontium" defines the tissues 
which surround the tooth in the tooth socket and includes both periodontal ligament and cementum. "Gingiva" defines 
the dense fibrous tissue, covered by oral mucosa, that envelopes the alveolar bone (tooth socket) processes of the 
upper and lower jaws, as well as the mineralized tooth wall as it emerges from the periodontium. “Viable" tissue means 
living, substantially healthy tissue essentially free of microorganisms and infection associated therewith. In particular, 
10 viable tissue means viable dental tissue such as enamel, dentine, tooth pulp, gingiva, cementum and periodontal 
ligament. "Enhancing viability" of dental tissue means improving the structural and functional integrity of living tissue, 
including improving the clinical status of damaged or diseased tissue. "Viable tooth" refers to an implanted natural 
tooth with a living tooth root. "Inhibit loss" of dental tissue, as used herein, means inhibiting damage to, and/or loss of, 
dental tissue and includes regenerating lost, damaged or diseased tissue and/or inhibiting additional damage thereto. 
15 [0036] "Residual dentine" means naturally formed, healthy dentine tissue, e.g., adjoining a carious or gingival lesion, 

particularly a lesion from which infected dentine has been ablated and/or bacterial plaque or tartar has been debrided. 
Naturally formed dentine tissue comprises tubules, the dental canaliculi, extending generally radially through the den- 
tine from the layer of odontoblasts lining the pulp chamber wall (described above in connection with FIGURE 1 ). Thus, 
a dentinal surface "transverse to the lumina of dental canaliculi" is a dentine surface disposed on any plane that inter- 
20 sects rather than parallels the lumina of one or more dental canaliculi. A "dentinal" surface can define a natural boundary 

of naturally formed dentine, or a fresh surface of dentine exposed by drilling or other dental techniques, or by fracture 
or chipping of the tooth wall. A treatment or stimulation "apposite" to a dentinal surface means a treatment or stimulation 
in juxtaposition or close proximity to the dentinal surface (e.g., separated from said surface by up to about a 1mm 
thickness of intervening tissue such as residual dentine). "Reparative dentine" comprises atubular dentine matrix elab- 
25 orated by mature or proliferating odontoblasts or other competent cells of the pulp connective tissue, and can be formed 
within the pulp chamber of a mammalian tooth. 

[0037] "Symptom alleviating cofactor" refers to one or more conventional pharmaceuticals which can, if desired, be 
included in compositions of this invention and which alleviate or mitigate one or more of the symptoms typically asso- 
ciated with loss of or damage to dental tissue. Exemplary cofactors include antibiotics, antiseptics, non-steroidal anti- 
30 inflammatory agents, anaesthetics and analgesics. 

II. Useful Morphogens 

[0038] Morphogens useful in this invention include eukaryotic proteins originally identified as osteogenic proteins 
35 (see U.S. Patent 5,011,691,), such as the OP-1, OP-2, OP-3 and CBMP2 proteins (Seq. ID Nos. 4-9, 15-22, 25 and 
26), as well as amino acid sequence-related proteins such as DPP (Seq. ID No. 10, from Drosophila), Vgl (Seq. ID No. 
11, from Xenopus), Vgr-1 (Seq. ID No. 12, from mouse), GDF-1 (Seq. ID Nos. 13, 30 and 31, from humans, see Lee 
(1991), 88 PNAS 4250-4254), 60A (Seq. ID Nos. 23 and 24, from Drosophila, see Wharton et al. (1991), 88 PNAS 
9214-9218), dorsalin-1 (from chick, see Basler et al. (1993), 73 Cell 687-702 and GenBank accession number LI 2032) 
4 0 and GDF-5 (from mouse, see Storm et al. (1994), 368 Nature 639-643). Additional useful morphogens include biosyn- 

thetic morphogen constructs disclosed in U.S. Pat. No. 5,011,691, e.g., COP-1 ,3-5, 7 and 16. See also U.S. Pat. No. 
4,968,590. 

[0039] Naturally occurring proteins identified and/or appreciated herein to be morphogens form a distinct subgroup 
within the loose evolutionary grouping of sequence-related proteins known as the TGFp superfamily or supergene 
45 family. The naturally occurring morphogens share substantial amino acid sequence homology in their C-terminal re- 
gions (domains). Typically, the above-mentioned naturally occurring morphogens are translated as a precursor, having 
an N-terminal signal peptide sequence, typically less than about 30 residues, followed by a "pro" domain that is cleaved 
to yield the mature C-terminal domain. The signal peptide is cleaved rapidly upon translation, at a cleavage site that 
can be predicted in a given sequence using the method of Von Heijne (1986), 14 Nucleic Acids Research 4683-4691 . 
50 The pro domain typically is about three times larger than the fully processed mature C-terminal domain. Herein, the 
"pro" form of a morphogen refers to a morphogen comprising a folded pair of polypeptides each comprising the pro 
and mature domains of a morphogen polypeptide. Typically, the pro form of a morphogen is more soluble than the 
mature form under physiological conditions. The pro form appears to be the primary form secreted from cultured mam- 
malian cells. 

55 [0040] Table I, below, summarizes various naturally occurring morphogens identified to date, including their nomen- 

clature as used herein, their Seq. ID references, and publication sources for the amino acid sequences for the full 
length proteins not included in the Seq. Listing. Each of the generic terms set forth in Table I is intended and should 
be understood to embrace morphogenically active proteins expressed from nucleic acids encoding the identified se- 
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quence mentioned below and set forth in the sequence listing, or a morphogenically active fragment or precursor 
thereof, including functional equivalents thereof such as naturally occurring and biosynthetic variants thereof. Naturally 
occurring variants thereof include allelic variant forms isolated from other individuals of a single biological species, and 
phylogenetic counterpart (species) variant forms isolated from phylogenetically distinct biological species. 



TABLE I 



"OP-1” 


Refers generically to morphogenically active proteins expressed from nucleic acid encoding the 
human OP-1 protein disclosed in Seq. ID No. 4 ("hOP-1 "), and includes at least mouse OP-1, Seq. 
ID No. 5 ( H mOP-1 "). In each of human and mouse OP-1 , Seq. ID Nos. 4 and 5, the conserved seven 
cysteine skeleton is defined by residues 38 to 139. cDNA sequences and amino acid sequences 
encoded therein and corresponding to the full length proteins are provided in Seq. ID Nos. 15 and 
1 6 (hOPI ) and Seq. ID Nos. 1 7 and 1 8 (mOPI .) The mature proteins are defined by residues 293-431 
(hOPI) and 292-430 (mOPI). The "pro” regions of the proteins, cleaved to yield the mature, 
morphogenically active proteins are defined essentially by residues 30-292 (hOPI ) and residues 
30-291 (mOPI). 


"OP-2" 


Refers generically to morphogenically active proteins expressed from a nucleic acid encoding the 
human OP-2 protein disclosed in Seq. ID No. 6 ( M hOP-2“), and includes at least mouse OP-2 ("mOP- 
2", Seq. ID No. 7). In each of human and mouse OP-2, the conserved seven cysteine skeleton is 
defined by residues 38 to 1 39 of Seq. ID Nos. 6 and 7. cDNA sequences and amino acid sequences 
encoded therein and corresponding to the full length proteins are provided in Seq. ID Nos. 19 and 
20 (hOP2) and Seq. ID Nos. 21 and 22 (mOP2.) The mature proteins are defined essentially by 
residues 264-402 (hOP2) and 261-399 (mOP2). The "pro" regions of the proteins, cleaved to yield 
the mature, morphogenically active proteins are defined essentially by residues 18-263 (hOP2) and 
residues 18-260 (mOPI). 


“OP-3" 


Refers generically to morphogenically active proteins expressed from a nucleic acid encoding the 
mouse OP-3 protein disclosed in Seq. ID No. 26 ("mOP-3"). The conserved seven cysteine domain 
is defined by residues 298 to 399 of Seq. ID No. 26, which shares greater than 79% amino acid 
identity with the corresponding mOP-2 and hOP-2 sequences, and greater than 66% identity with 
the corresponding OP-1 sequences. A cDNA sequence encoding the above-mentioned amino acid 
sequence is provided in Seq. ID No. 25. OP-3 is unique among the morphogens identified to date 
in that the residue at position 9 in the conserved seven cysteine domain (e.g., residue 315 of Seq. 
ID No. 26) is a serine, whereas other morphogens typically have a tryptophan at this location. 


"CBMP2" 


Refers generically to morphogenically active proteins expressed from a nucleic acid encoding the 
CBMP2 proteins, including at least human CBMP2A ("CBMP2A(fx) n , Seq ID No. 8) and human 
CBMP2B ( ,, CBMP2B(fx)", Seq. ID No. 9). The amino acid sequence for the full length proteins, 
referred to in the literature as BMP2A and BMP2B, or BMP2 and BMP4, appear in Wozney, et al. 
(1988), 242 Science 1 528-1 534. The pro domain for BMP2 (BMP2A) likelv includes residues 25-248: 
the. mature protein, residues 249-396. The pro domain for BMP4 (BMP2B) likely includes residues 
25-256; the mature protein, residues 257-408. 


"DPP(fx)" 


. 

refers to proteins encoded by the Drosophila DPP gene and defining the conserved seven cysteine 
skeleton (Seq. ID No. 10). The amino acid sequence for the full length protein appears in Padgett, 
et al (1987), 325 Nature 81-84. The pro domain likelv extends from the siqnal peptide cleavaae site 
to residue 456; the mature protein likely is defined by residues 457-588. 


*'Vgl(fx) M 


refers to proteins encoded by the Xenopus Vgl gene and defining the conserved seven cysteine 
skeleton (Seq. ID No. 11). The amino acid sequence for the full length protein appears in Weeks 
(1987), 51 Cell 861-867. The prodomain likely extends from the signal peptide cleavage site to 
residue 246; the mature protein likely is defined by residues 247-360. 
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TABLE I (continued) 





"Vgr-I(fx)" 


refers to proteins encoded by the murine Vgr-1 gene and defining the conserved seven cysteine 






skeleton (Seq. ID No. 1 2). The amino acid sequence for the full length protein appears in Lyons, et 
al, (1 989), 86 PNAS 4554-4558. The prodomain likely extends from the signal peptide cleavaqe site 


5 




to residue 299; the mature protein likely is defined by residues 300-438. 




"GDF-1 (fx)" 


refers to proteins encoded by the human GDF-1 gene and defining the conserved seven cysteine 
skeleton (Seq. ID No. 13). The cDNA and encoded amino sequence for the full length protein are 


10 




provided in Seq. ID. Nos. 30 and 31 . The prodomain likely extends from the signal peptide cleavage 
site to residue 214; the mature protein likely is defined by residues 21 5-372. 




"60A" 


refers generically to morphogenically active proteins expressed from nucleic acid (e.g., the 
Drosophila 60A gene) encoding 60A protein or morphogenically active fragments thereof (see Seq. 


15 




ID Nos. 23 and 24 wherein the cDNA and encoded amino acid sequence for the full length protein 
are provided). "60A(fx)" refers to the protein sequences defining the conserved seven cysteine 
skeleton (residues 354 to 455 of Seq. ID No. 24.) The prodomain likely extends from the signal 






peptide cleavage site to residue 324; the mature protein likely is defined by residues 325-455. The 
60A protein is considered likely herein to be a phylogenetic counterpart variant of the human and 


20 




mouse OP-1 genes; Sampath et al. (1993), 90 PNAS 6004-6008. 




"BMP3(fx)° 


refers to proteins encoded by the human BMP3 gene and defining the conserved seven cysteine 
skeleton (Seq. ID No. 26). The amino acid sequence for the full length protein appears in Wozney 
et al. (1 988), 242 Science 1 528-1 534. The pro domain likely extends from the signal peptide cleavaqe 


25 




site to residue 290; the mature protein likely is defined by residues 291-472. 




"BMP5(fx)° 


refers to proteins encoded by the human BMP5 gene and defining the conserved seven cysteine 
skeleton (Seq. ID No. 27). The amino acid sequence for the full length protein appears in Celeste, 
et al. (1991), 87 PNAS 9843-9847. The pro domain likely extends from the siqna! peptide cleavaqe 


30 




site to residue 31 6; the mature protein likely is defined by residues 31 7-454. 


35 


"BMP6(fx)° 


refers to proteins encoded by the human BMP6 gene and defining the conserved seven cysteine 
skeleton (Seq. ID No. 28). The amino acid sequence for the full length protein appears in Celeste, 
et al. (1 990), 87 PNAS 9843-5847. The pro domain likely includes extends from the signal peptide 




cleavage site to residue 374; the mature sequence likely includes residues 375-513. 



[0041] As shown in FIGURE 2, the OP-2 and OP-3 proteins have an additional cysteine residue in the conserved 
C-terminal region (e.g., see residue 41 of Seq. ID Nos. 6 and 7), in addition to the conserved cysteine skeleton or 
domain in common with the other known proteins in this family. The GDF-1 protein has a four amino acid insert within 
the conserved skeleton (residues 44-47 of Seq. ID No. 13) but this insert likely does not interfere with the relationship 
of the cysteines in the folded structure. Further, the CBMP2 proteins are missing one amino acid residue within the 
cysteine skeleton. Thus, these morphogen polypeptides illustrate principles of alignment used herein with respect to 
the preferred reference morphogen sequence of human OP-1 , residues 38-139 of Seq. ID No. 4. 

[0042] In certain preferred embodiments, morphogens useful herein include those in which the amino acid sequences 
of morphogen polypeptides comprise a sequence sharing at least 70% amino acid sequence homology or "similarity", 
and preferably 80% homology or similarity with a reference morphogen selected from the foregoing naturally occurring 
morphogens. Preferably, the reference morphogen is human OP-1, and the reference sequence thereof is the C- 
terminal seven cysteine domain present in morphogenically active forms of human OP-1, residues 38-139 of Seq. ID 
No. 4. Morphogens useful herein accordingly include allelic, phylogenetic counterpart and other variants of the preferred 
reference sequence, whether naturally-occurring or biosyntheticaily produced (e.g., including "muteins" or "mutant 
proteins"), as well as novel members of the morphogenic family of proteins including the morphogens set forth and 
identified above, e.g., in Table I. Certain particularly preferred morphogen polypeptides share at least 60% amino acid 
identity with the preferred reference sequence of human OP-1, still more preferably at least 65% amino acid identity 
therewith. 

[0043] In other preferred embodiments, the family of morphogen polypeptides useful in the present invention, and 
members thereof, are defined by a generic amino acid sequence. For example, Generic Sequence 7 (Seq. ID No. 1) 
and Generic Sequence 8 (Seq. ID No. 2) disclosed below, accommodate the homologies shared among preferred 
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morphogen protein family members identified to date, including at least OP-1 , OP-2, OP-3, CBMP2A, CBMP2B, BMP3, 
60A, DPP, Vgl, BMPS, BMP6, Vgr-1, and GDF-1 (Seq. ID Nos. 4-15, 24, and 26-29). The amino acid sequences for 
these proteins are described herein (see Sequence Listing) and/or in the art, as summarized above. The generic 
sequences include both the amino acid identity shared by these sequences in the C-terminal domain, defined by the 
5 six and seven cysteine skeletons (Generic Sequences 7 and 8, respectively), as well as alternative residues for the 
variable positions within the sequence. The generic sequences provide an appropriate cysteine skeleton where inter- 
or intramolecular disulfide bonds can form, and contain certain critical amino acids likely to influence the tertiary struc- 
ture of the folded proteins. In addition, the generic sequences allow for an additional cysteine at position 41 (Generic 
Sequence 7) or position 46 (Generic Sequence 8), thereby encompassing the morphogenically active sequences of 
io OP-2 and OP-3. 
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Generic Sequence 7 









Leu 


Xaa 


Xaa 


Xaa 


Phe 


Xaa 


Xaa 








1 








5 






Xaa 


Gly 


Trp 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Pro 






10 










15 






Xaa 


Xaa 


Xaa 


Xaa 


Ala 


Xaa 


Tyr 


Cys 


Xaa 


Gly 






20 
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Xaa 


Cys 


Xaa 


Xaa 


Pro 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 
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Xaa 


Xaa 


Xaa 


Asn 


His 


Ala 


Xaa 


Xaa 


Xaa 


Xaa 
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Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 






50 
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Xaa 


Xaa 


Xaa 


Cys 


Cys 


Xaa 


Pro 


Xaa 


Xaa 


Xaa 






60 










65 






Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Leu 


Xaa 


Xaa 


Xaa 


Xaa 






70 










75 






Xaa 


Xaa 


Xaa 


Val 


Xaa 


Leu 


Xaa 


Xaa 


Xaa 


Xaa 






80 










85 






Xaa 


Met 


Xaa 


Val 


Xaa 


Xaa 


Cys 


Xaa 


Cys 


Xaa 



90 95 

wherein each Xaa independently is selected from a group of one or more specified amino acids defined as follows: 
"Res." means "residue" and Xaa at res.2 = (Tyr or Lys); Xaa at res.3 = Val or lie); Xaa at res.4 = (Ser, Asp or Glu); Xaa 
at res.6 = (Arg, Gin, Ser, Lys or Ala); Xaa at res.7 = (Asp or Glu); Xaa at res.8 = (Leu, Val or lie); Xaa at res. 11 = (Gin, 
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Leu, Asp, His, Asn or Ser); Xaa at res.12 = (Asp, Arg, Asn or Glu); Xaa at res. 13 = (Trp or Ser); Xaa at res.14 = (lie 
or Val); Xaa at res. 15 = (lie or Val); Xaa at res. 16 (Ala or Ser); Xaa at res. 18 = (Glu, Gin, Leu, Lys, Pro or Arg); Xaa at 
res. 19 = (Gly or Ser); Xaa at res.20 = (Tyr or Phe); Xaa at res.21 = (Ala, Ser, Asp, Met, His, Gin, Leu or Gly); Xaa at 
res. 23 = (Tyr, Asn or Phe); Xaa at res.26 = (Glu, His, Tyr, Asp, Gin, Ala or Ser); Xaa at res.28 = (Glu, Lys, Asp, Gin or 
5 Ala); Xaa at res. 30 = (Ala, Ser, Pro, Gin, lie or Asn); Xaa at res.31 = (Phe, Leu or Tyr); Xaa at res.33 = (Leu, Val or 
Met); Xaa at res. 34 = (Asn, Asp, Ala, Thr or Pro); Xaa at res.35 = (Ser, Asp, Glu, Leu, Ala or Lys); Xaa at res. 36 = (Tyr, 
Cys, His, Ser or lie); Xaa at res.37 = (Met, Phe, Gly or Leu); Xaa at res.38 = (Asn, Ser or Lys); Xaa at res.39 = (Ala, 
Ser, Gly or Pro); Xaa at res.40 = (Thr, Leu or Ser); Xaa at res.44 = (He, Val or Thr); Xaa at.res.45 = (Val, Leu, Met or 
lie); Xaa at res.46 = (Gin or Arg); Xaa at res.47 = (Thr, Ala or Ser); Xaa at res.48 = (Leu or lie); Xaa at res.49 = (Val 
io or Met); Xaa at res.50 = (His, Asn or Arg); Xaa at res.51 = (Phe, Leu, Asn, Ser, Ala or Val); Xaa at res.52 = (lie, Met, 

Asn, Ala, Val, Gly or Leu); Xaa at res.53 = (Asn, Lys, Ala, Glu, Gly or Phe); Xaa at res.54 = (Pro, Ser or Val); Xaa at 
res. 55 = (Glu, Asp, Asn, Gly, Val, Pro or Lys); Xaa at res.56 = (Thr, Ala, Val, Lys, Asp, Tyr, Ser, Gly, lie or His); Xaa at 
res.57 = (Val, Ala or lie); Xaa at res.58 = (Pro or Asp); Xaa at res.59 = (Lys, Leu or Glu); Xaa at res.60 = (Pro, Val or 
Ala); Xaa at res.63 = (Ala or Val); Xaa at res.65 = (Thr, Ala or Glu); Xaa at res.66 = (Gin, Lys, Arg or Glu); Xaa at res. 
is 67 = (Leu, Met or Val); Xaa at res.68 = (Asn, Ser, Asp or Gly); Xaa at res.69 = (Ala, Pro or Ser); Xaa at res.70 = (lie, 

Thr, Val or Leu); Xaa at res.71 = (Ser, Ala or Pro); Xaa at res.72 = (Val, Leu, Met or lie); Xaa at res.74 = (Tyr or Phe); 
Xaa at res.75 = (Phe, Tyr, Leu or His); Xaa at res.76 = (Asp, Asn or Leu); Xaa at res.77 = (Asp, Glu, Asn, Arg or Ser); 
Xaa at res.78 = (Ser, Gin, Asn, Tyr or Asp); Xaa at res.79 = (Ser, Asn, Asp, Glu or Lys); Xaa at res.80 = (Asn, Thr or 
Lys); Xaa at res.82 = (He, Vat or Asn); Xaa at res.84 = (Lys or Arg); Xaa at res.85 = (Lys, Asn, Gin, His, Arg or Val); 
20 Xaa at res.86 = (Tyr, Glu or His); Xaa at res.87 = (Arg, Gin, Glu or Pro); Xaa at res.88 = (Asn, Glu, Trp or Asp); Xaa 

at res.90 = (Val, Thr, Ala or lie); Xaa at res.92 = (Arg, Lys, Val, Asp, Gin or Glu); Xaa at res.93 = (Ala, Gly, Glu or Ser); 
Xaa at res.95 = (Gly or Ala) and Xaa at res.97 = (His or Arg). 

[0044] Generic Sequence 8 (Seq. ID No. 2) includes all of Generic Sequence 7 and in addition includes the following 
sequence (Seq. ID No. 14) at its N-terminus: 

25 . 

Cys Xaa Xaa Xaa Xaa 
1 5 

30 

Accordingly, beginning with residue 7, each "Xaa" in Generic Seq. 8 is a specified amino acid defined as for Generic 
Seq. 7, with the distinction that each residue number described for Generic Sequence 7 is shifted by five in Generic 
Seq. 8. Thus, “Xaa at res.2 =(Tyr or Lys)" in Gen. Seq. 7 refers to Xaa at res. 7 in Generic Seq. 8. In Generic Seq. 8, 
35 Xaa at res.2 = (Lys, Arg, Ala or Gin); Xaa at res.3 = (Lys, Arg or Met); Xaa at res.4 = (His, Arg or Gin); and Xaa at res. 

5 = (Glu, Ser, His, Gly, Arg, Pro, Thr, or Tyr). 

[0045] As noted above, certain currently preferred morphogen polypeptide sequences useful in this invention have 
greater than 60% identity, preferably greater than 65% identity, with the amino acid sequence defining the preferred 
reference sequence of hOP-1 . These particularly preferred sequences include allelic and phylogenetic counterpart 
*0 variants of the OP-1 and OP-2 proteins, including the Drosophila 60A protein. Accordingly, in certain particularly pre- 
ferred embodiments, useful morphogens include active proteins comprising pairs of polypeptide chains within the ge- 
neric amino acid sequence herein referred to as "OPX" (Seq. ID No. 3), which defines the seven cysteine skeleton and 
accommodates the homologies between several identified variants of OP1 and OP2. As described therein, each Xaa 
at a given position independently is selected from the residues occurring at the corresponding position in the C-terminal 
45 sequence of mouse or human OP1 or OP2 (see Seq. ID Nos. 4-7 and/or Seq. ID Nos. 15-22). 

[0046] In still other preferred embodiments, useful morphogen polypeptides have amino acid sequences comprising 
a sequence encoded by nucleic acid that hybridizes, under stringent hybridization conditions, to DNA or RNA encoding 
reference morphogen sequences, e.g., C-terminal sequences defining the conserved seven cysteine domains of OP1 
or OP2, e.g., nucleotides 1036-1341 and nucleotides 1390-1 695 of Seq. ID No. 15 and 19, respectively. As used herein, 
so stringent hybridization conditions are defined as hybridization according to known techniques in 40% formamide, 5 X 
SSPE, 5 X Denhardt's Solution, and 0.1% SDS at 37°C overnight, and washing in 0.1 X SSPE, 0.1% SDS at 50°C. 
[0047] As noted above, morphogens useful in the present invention generally are dimeric proteins comprising a 
folded pair of the above polypeptides. Morphogens are inactive when reduced, but are active as oxidized homodimers 
and when oxidized in combination with other morphogens of this invention to produce heterodimers. Thus, members 
55 of a folded pair of morphogen polypeptides in a morphogenically active protein can be selected independently from 
any of the specific morphogen polypeptides mentioned above. 

[0048] The morphogens useful in the methods, compositions and devices of this invention include proteins compris- 
ing any of the polypeptide chains described above, whether isolated from naturally-occurring sources, or produced by 
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recombinant DNA or other synthetic techniques, and includes allelic and phylogenetic counterpart variants of these 
proteins, as well as biosynthetic variants (muteins) thereof, and various truncated and fusion constructs. Deletion or 
addition mutants also are envisioned to be active, including those which may alter the conserved C-terminal six or 
seven cysteine domain, provided that the alteration does not functionally disrupt the relationship of these cysteines in 
5 the folded structure. Accordingly, such active forms are considered the equivalent of the specifically described con- 
structs disclosed herein. The proteins may include forms having varying glycosylation patterns, varying N-termini, a 
family of related proteins having regions of amino acid sequence homology, and active truncated or mutated forms of 
native or biosynthetic proteins, produced by expression of recombinant DNA in host cells. 

[0049] The morphogenic proteins can be expressed from intact or truncated cDNA or from synthetic DNAs in pro- 
10 caryotic or eucaryotic host cells, and purified, cleaved, refolded, and dimerized to form morphogenically active com- 
positions. Currently preferred host cells include E coli or mammalian cells, such as CHO, COS or BSC cells. A detailed 
description of the morphogens useful in the methods, compositions and devices of this invention is disclosed in Co.- 
pending U.S. Serial Nos. 07/752,764 (published as W092/1 5323), filed August 30, 1991, and 07/667,724 (abandoned 
in favor of 07/752,764), filed March 11, 1991. 

is [0050] Thus, in view of this disclosure, skilled genetic engineers can isolate genes from cDNA or genomic libraries 
of various different biological species, which encode appropriate amino acid sequences, or construct DNAs from oli- 
gonucleotides, and then can express them in various types of host cells, including both procaryotes and eucaryotes, 
to produce large quantities of active proteins capable of stimulating the morphogenesis of, and/or inhibiting damage 
to, mammalian dental tissues. 

20 [0051 ] As noted above, a protein is morphogenic herein generally if it induces the developmental cascade of cellular 

and molecular events that culminate in the formation of new, organ-specific tissue. Preferably, a morphogen comprises 
a pair of polypeptides having a sequence that corresponds to or is functionally equivalent to at least the conserved C- 
terminal six or seven cysteine skeleton of human OP-1, included in Seq. ID No. 4. The morphogens generally are 
competent to induce all of the following biological functions in a morphogenically permissive environment: stimulating 
25 proliferation of progenitor cells; stimulating the differentiation of progenitor cells; stimulating the proliferation of differ- 
entiated cells; and supporting the growth and maintenance of differentiated cells. Details of how the morphogens useful 
in this invention first were identified, as well as a description on how to make, use and test them for morphogenic 
activity are disclosed in U.S. S. Nos. 07/752,764 (published as W092/15323) and 07/667,274 (abandoned in favor of 
07/752,764). As disclosed therein, the morphogens can be purified from naturally-sourced material or recombinantly 
30 produced from procaryotic or eucaryotic host cells, using the genetic sequences disclosed therein. Alternatively, novel 
morphogenic sequences can be identified following the procedures disclosed therein. 

[ 0052 ] Exemplary useful morphogens include naturally derived proteins comprising a pair of polypeptides, the amino 
acid sequences of which comprise one or more of the sequences disclosed in the Sequence Listing and FIGURE 2. 
Other useful sequences include those of the naturally derived morphogens dorsalin-1 and GDF-5, discussed herein 
35 in connection with Table I, as well as biosynthetic constructs disclosed in U.S. Pat. 5,011,691 (e.g., COP-1 , COP-3, 
COP-4, COP-5, COP-7, and COP-16). 

[ 0053 ] Accordingly, certain preferred morphogens useful in the methods and compositions of this invention can be 
described as morphogenically active proteins having amino acid sequences sharing 70% or, preferably, 80% homology 
(similarity) with a reference morphogen sequence described above, e.g., residues 38-139 of Seq. ID No. 4, where 
40 "homology" is as defined herein above.. Alternatively, in other preferred embodiments, morphogens useful in the meth- 
ods and compositions disclosed herein fall within the family of polypeptides described by Generic Sequence 7, Seq. 
ID No. 1 , more preferably by Generic Sequence 8, Seq. ID No. 2 . 

[ 0054 ] FIGURE 2 herein sets forth an alignment of the amino acid sequences of the active regions of naturally oc- 
curring proteins that have been identified or appreciated herein as morphogens, including human OP-1 (hOP-1, Seq. 
45 ID Nos. 4 and 15-16), mouse OP-1 (mOP-1, Seq. ID Nos. 5 and 17-18), human and mouse OP-2 (Seq. ID Nos. 6, 7, 

. and 19-22), mouse OP-3 (Seq. ID Nos. 25-26), CBMP2A (Seq. ID No. 8), CBMP2B (Seq. ID No. 9), BMP3 (Seq. ID 
No. 27), DPP (from Drosophila, Seq. ID No. 10), Vgl, (from Xenopus, Seq. ID No. 11), Vgr-1 (from mouse, Seq. ID 
No. 12), GDF-1 (from mouse and/or human, Seq. ID Nos. 13, 30 and 31), 60A protein (from Drosophila, Seq. ID Nos. 
23 and 24), BMP5 (Seq. ID No. 28) and BMP6 (Seq. ID No. 29). The sequences are aligned essentially following the 
so method of Needleman et al. (1970), 48 J. Mol. Biol., 443-453, calculated using the Align Program (DNAstar, Inc). In 

FIGURE 2, three dots indicates that the amino acid in that position is the same as the corresponding amino acid in 
hOP-1 . Three dashes indicates that no amino acid is present in that position, and are included for purposes of illustrating 
homologies. For example, amino acid residue 60 of CBMP-2A and CBMP-2B is “missing". Of course, both of these 
amino acid sequences in this region comprise Asn-Ser (residues 58, 59), with CBMP-2A then comprising Lys and lie, 
55 whereas CBMP-2B comprises Ser and lie. FIGURE 2 also illustrates the handling of insertions in the morphogen amino 
acid sequence: between residues 56 and 57 of BMP3 is an inserted Val residue; between residues 43 and 44 of GDF- 
1 is inserted the amino acid sequence, Gly-Gly-Pro-Pro. Such deviations from the reference morphogen sequence are 
ignored for purposes of calculating the defined relationship between, e.g., GDF-1 and hOP-1 . As is apparent from the 
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amino acid sequence comparisons set forth in FIGURE 4, significant amino acid changes can be made from the ref- 
erence sequence while retaining morphogenic activity. For example, while the GDF-1 protein sequence depicted in 
FIGURE 4 shares only about 50% amino acid identity with the hOPI sequence described therein, the GDF-t sequence 
shares greater than 70% amino acid sequence homology (or "similarity") with the hOPI sequence, where "homology" 
5 or "similarity" includes allowed conservative amino acid substitutions within the aligned sequence, e.g., as defined by 
Dayhoffeta!., (f 979) 5 Atlas of Protein Sequence and Structure Suppl. 3, pp.345-362, (M.O. Dayhoff, ed., Nat'f BioMed. 
Res. Fd’n, Washington D.C.). 

[ 0055 ] The currently most preferred protein sequences useful as morphogens in this invention include those having 
greater than 60% identity, preferably greater than 65% identity, with the amino acid sequence defining the conserved 
10 six or seven cysteine skeleton of hOPI (e.g., residues 43-139 or 38-139 of Seq. ID No: 5). These most preferred 
sequences include both allelic and phylogenetic counterpart variants of the OP-1 and OP-2 proteins, including the 
Drosophila 60A protein. Accordingly, in still another preferred aspect, the invention includes morphogens comprising 
species of polypeptide chains having the generic amino acid sequence referred to herein as "OPX", which defines the 
seven cysteine domain and accommodates the identities and homologies between the various identified OP1 and OP2 
15 proteins. OPX is presented in Seq. ID No. 3. As described therein, each Xaa at a given position independently is 
selected from the residues occurring at the corresponding position in the C-terminal sequence of mouse or human 
OP1 or OP2 (see FIGURE 2 and Seq. ID Nos. 4-7 and/or Seq. ID Nos. 15-22). 

[ 0056 ] Alternatively, an effective amount of an agent competent to stimulate endogenous morphogen levels in a 
mammal may be administered by any of the routes described herein. For example, an agent competent to stimulate 
20 morphogen production and/or secretion from periodontal tissue, gingiva, alveolar bone tissue in the tooth socket, or 
pulp tissue, may be provided to a mammal, e.g., by direct administration of the morphogen-stimulating agent to dental 
tissue. Alternatively, the morphogen-stimulating agent may induce morphogen expression and/or secretion at a distant 
site (e.g., at a tissue locus other than dental tissue), with the expressed morphogen circulating to dental tissue com- 
petent to take up the morphogen and respond thereto. A method for identifying and testing agents competent to mod- 
25 ulate the levels of endogenous morphogens in a given tissue is described in detail in prior related U.S^S.Nos. 07/938,021 

(published as WO93/05172) and 07/752,859 (published as WO93/05751 . Briefly, candidate compounds can be iden- 
tified and tested by incubation in vitro with a test tissue or cells thereof, or a cultured cell line derived therefrom, for a 
time sufficient to allow the compound to affect the production, i.e., the expression and/or secretion, of a morphogen 
produced by the cells of that tissue. Here, suitable tissue, or cultured cells of a suitable tissue, preferably can be 
30 selected from renal epithelium, dental fibroblasts, cementoblasts, odontoblasts and osteoblasts. 

III. Formulations and Methods for Administration 

[ 0057 ] The morphogens can be provided to a dental tissue surface, e.g. , a dentinal or gingival surface, by any suitable 
35 means. Preferably, the morphogen, or a morphogen-stimulating agent, is provided directly to the tissue surface by 
topical administration. Alternatively, the morphogen can be provided to the tissue by, for example, local injection. While 
not currently preferred, systemic injection also may be a viable administration route under certain circumstances, such 
as to treat advanced or chronic disease states, or as a preventive measure in individuals at extreme risk of disease. 
A detailed description of considerations for systemic administration, including oral and parenteral administration, is 
disclosed, for example in U.S.S.No. 08/165,511. (published as WO93/04692). 

[ 0058 ] Where the morphogen is provided directly to a dentinal surface, it can be administered as part of a biocom- 
patible formulation that may be a liquid, gel or solid. For example, it can be dispersed in an aqueous medium that does 
not impair the mammal's physiologic fluid or salt balance. The aqueous medium for the morphogen thus may comprise 
normal physiologic saline (0.85% or 0.15 M NaCI), pH 7.0-7. 4. The aqueous morphogen formulation can be made, for 
45 example, by dissolving the morphogen in 50% ethanol containing acetonitrile in 0.1 % trifluoroacetic acid (TFA) or 0.1 % 
HCI, or equivalent solvents. One volume of the resultant solution then is added, for example, to ten volumes of phos- 
phate buffered saline (PBS), which further can include 0.1 -0.2% human serum albumin (HSA) or another acceptable 
carrier protein. The resultant solution preferably is vortexed extensively. The morphogen further can be dispersed in 
and associated with a carrier capable of maintaining the morphogen at the administered locus. Useful formulations for 
so some embodiments herein include viscous compositions and evaporative compositions. Biocompatible compositions 
that increase viscosity of the formulation include glycerol, polyalkylene glycols such as polyethylene glycol, oils of 
vegetable origin, hydrogenated naphthalenes, and the like. Useful evaporative compositions include physiologically 
acceptable, e.g., biologically inert, liquids that evaporate under physiological conditions so as to leave a residue of 
morphogen on the tissue surface. Evaporative liquids include low molecular weight organic or inorganic compounds 
55 such as water, ethanol, isopropanol, acetic acid and the like that do not adversely affect tissue function or tissue struc- 
tural integrity prior to evaporating. 

[0059] The formulation also can include an in vivo bioresorbable carrier material that acts as a controlled release 
delivery vehicle. Useful carriers can include biocompatible, preferably biodegradable structural components from, e. 
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g., an extracellular matrix, such as collagen, laminin, hyaluronic acid, and the like, or polymeric materials, such as 
polylactic, polybutyric and polyglycolic acids. The carrier also can comprise an acellular tissue matrix, substantially 
depleted in nonstructural components, such as a demineralized, guanidine-extracted bone, dentine, periodontal liga- 
ment or cementum matrix. Details for preparing such matrices are disclosed in U.S.S.N. 07/752,764 (published as 
W092/15323). Other useful controlled release carriers in which the morphogen can be dispersed are described in U. 
S. Pat. Nos. 4,975,526 and 4,919,939. Such carriers are envisioned to be particularly useful where the morphogen is 
used to seal a cavity. 

[0060] Preferably, morphogen compositions that are viscous, evaporative or comprise a bioresorbable carrier are 
suitable for topical administration to a dentinal or gingival surface, and can inhibit recession or enhance regenerative 
healing of gingival tissue as well as stimulating morphogenesis of dentine tissue. 

[0061] If desired, a given morphogen can be made more soluble in the aqueous composition by association with a 
suitable molecule. For example, the pro form of a morphogenic protein typically is more soluble or dispersible in phys- 
iological solutions than the corresponding mature form, in fact, endogenous morphogens are thought to be transported 
(e.g., secreted and circulated) in the mammalian body in this form. This soluble form of the protein can be obtained 
from culture medium of morphogen-secreting mammalian cells e.g., cells transfected with nucleic acid encoding and 
competent to express the morphogen. Alternatively, a soluble species can be formulated by complexing the mature 
dimer (or an active fragment thereof) with a morphogen pro domain or a solubility-enhancing fragment thereof (de- 
scribed more fully below). Other components, including various serum proteins, also can be useful to enhance mor- 
phogen solubility. 

[0062] Finally, the morphogens or morphogen-stimulating agents provided herein can be administered alone or in 
combination with other molecules, particularly symptom alleviating cofactors. Useful pharmaceutical cofactors for mit- 
igating symptoms associated with damage to dental tissue include antiseptics, antibiotics, anaesthetics and analgesics. 
Preferred antiseptics for use in the present system include chlorhexidine and tibezonium iodide; preferred antibiotics 
include tetracycline, aminoglycosides such as neomycin, gentamycin, kanamycin, tobramycin, netilmicin, sisomicin, 
amicamycin, their sulfates or other derivatives, macrolides such as erythromycin, its salts and other derivatives, 
spiramycin, josamicin or miocamicin, penicillins such as ampicillin, amoxicillin and the like, and cephalosporins, for 
example, cefaclor, cefadroxil, cefazolin, cefoperazone, cefotaxime, cephalothin, cefalexin, ceforanide, cefonicide or 
ceftriaxone. Preferred anaesthetics/analgesics include amide-type local anaesthetics such as lidocaine, mepivacaine, 
pyrrocaine, bupivacaine, prilocaine, etidocaine, or other widely used anaesthetics such as procaine. 

[0063] Other cofactors include non-steroidal anti-inflammatory agents. However, the morphogens described herein 
themselves can modulate the body's inflammatory/immune response to an initial tissue injury. Specifically, and as 
described in detail in U.S.S.N. 08/165,511 (published as WO93/04692), in the presence of a morphogen, proinflam- 
matory effector cells induced to migrate to a site of tissue injury do not become significantly activated Without being 
limited to any given theory, it is thought that, in the presence of the morphogen, damaged tissue is induced to undergo 
a recapitulation of tissue morphogenesis, where progenitor cells are induced to proliferate and differentiate in a tissue- 
specific manner, and new, functional, organized tissue is formed to replace the damaged or lost tissue, rather than 
disorganized, fibrous scar tissue. 

[0064] The formulated compositions contain therapeutically effective amounts of the morphogen, e.g., amounts which 
provide appropriate concentrations of the morphogen to the dentinal surface for a time sufficient to stimulate morpho- 
genesis of dentine and/or production of dentine matrix apposite thereto. As will be appreciated by those skilled in the 
art, the concentration of the compounds described in a therapeutic composition of the present invention will vary de- 
pending upon a number of factors, including the biological efficacy of the selected morphogen, the chemical charac- 
teristics (e.g., hydrophobicity) of the compounds employed, the formulation of the compound excipients, the adminis- 
tration route, and the treatment envisioned, including whether reparative dentine is to be induced at a distance, e.g., 
up to about 0.5mm, from the site of application. The preferred dosage to be administered also is likely to depend on 
such variables such as the condition of the dental tissues particularly of the dentinal surface to which morphogen is to 
be applied, the size of the tooth or dentinal surface to be treated, extent of dental tissue loss or recession, and the 
overall health status of the particular patient. In general, 0.00001 -1 000 mg of morphogen are sufficient with 0.0001 -1 00 
mg being preferable and 0.001 to 10 mg being even more preferable for primate teeth, including human teeth. No 
obvious morphogen induced pathological lesions arise when mature morphogen (e.g., OP-1, 20 mg) is administered 
daily to normal growing rats for 21 consecutive days. Moreover, 10 mg systemic injections of morphogen (e.g., OP-1) 
injected daily for 10 days into normal newborn mice does not produce any gross abnormalities. 

[0065] Practice of the invention, including additional preferred aspects and embodiments thereof, will be still more 
fully understood from the following examples, which are presented herein for illustration only and should not be con- 
strued as limiting the invention in any way. 
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IV, Examples 

Example 1: Morphogen-lnduced Dentinogenesis in Mammalian Teeth 

[0066] The following studies demonstrate the efficacy of morphogens in inducing dentine tissue morphogenesis in 
model mammals. Human dental pulp has been observed to respond unpredictably to injury. Currently, this represents 
a basic clinical problem in dentistry. Accordingly, primates are used herein as model mammals for demonstration of 
dentine regeneration. Those skilled in the dental arts will understand and appreciate the correlation between human 
and nonhuman primate dental biology. 

[0067] Recombinant human osteogenic protein-1 (hOP-1 , Seq. ID No. 4), when applied to freshly cut primate residual 
dentine, stimulated significantly more reparative dentine formation than calcium hydroxide paste (a conventional treat- 
ment). The response to OP-1 was dependent upon the concentration applied to the tooth as a cavity liner as well as 
the thickness of the residual dentine. The response to calcium hydroxide similarly was dependent upon the thickness 
of residual dentine. 

[0068] Dentine matices have been shown to contain bone morphogenetic protein (BMP) activity (Bang and Urist 
(1967), 94 Arch. Surg. 781-789; Youmans and Urist (1967), 12 Arch . Oral. Biol. 999-1008; Butler et al. (1977), 56 J. 
Dent. Res . 288-232; Bessho etal. (1990), 70 J. Dent. Res. 171-175), growth factors (Finklemen etal. (1990), 5 J. Bone 
Min. Res. 717-723) and dentinogenic activity (Anneroth and Bang (1972), 23 Odont. Rev. 315-328; Nakashima, M. 
(1989), 5 Endodont. Dent. Traumat. 279-286; Nakashima, M. (1990), 35 Arch. Oral. Biol. 493-497; Nakashima, M. 
(1990), 35 Arch. Oral. Biol. 277-281; Tziafas and Kolokuris (1990), 69 J. Dent. Res. 75-81; Tziafas etal. (1992), 37 
Arch. Oral. Biol. 119-128; Smith et al. (1994), 39 Arch. Oral. Biol. 13-22). Impure extracts of dentine with BMP activity 
(Nakashima, M. (1 990), 35 Arch. Oral. Biol. 493-497; Nakashima, M. (1 990), 35 Arch. Oral. Biol. 277-281 ), recombinant 
BMP-2, and BMP-4 (Nakashima, M. (1994), 73 J. Dent. Res. 151 5-1522) and recombinant human osteogenic protein- 
1 (OP-1, BMP-7) (Rutherford etal. (1993), 38 Arch. Oral. Biol. 571-576; Rutherford et at, (1994), 39 Arch. Oral. Biol. 
833-838) induce reparative dentineogenesis when placed on partially amputated pulps in mature adult teeth, see also 
U.S.S.Nos. 07/752,764 (published as W092/15323) and 08/155,343 (published as WO94/06399). In addition, dental 
pulps (Vaino etal. (1993), 75 Cell. 45-58; Heikinheimo, H. (1994), 73 .J. Dent. Res. 590-597) or celts derived from 
dental pulps (Takeda etal. (1994), 15 Bone 467-470) differentially express some morphogen genes. Accordingly, the 
present study explored whether solubilized OP-1 induced dentine formation when placed on freshly cut dentine surfaces 
in monkey permanent teeth. 

[0069] Ninety (90) incisor, premolar and molar permanent teeth were anesthetized with Carbocaine (Cook-Waite) 
without vasoconstrictor, isolated by rubber dam, cleaned with a coolant. The variation in the area of the pulpal floors 
was less than 10% and the mean thickness of the residual floor dentine varied from approximately 0.1 to 0.9 mm 
between different teeth (as measured histomorphometrically). The pulpal floors were covered a fixed volume of an 
evaporative solution containing 0.01, 0.1, 1 or lOpg OP-1 in acid-alcohol (28.5% ethanol, 0.025% HCL), acid-alcohol 
alone, a thin layer of calcium hydroxide paste (Dycal, L.D. Caulk, Wilmington DE) or filled without a liner (no treatment). 
The cavities were filled with Ketac Silver (ESPE-Premier, Norristown, PA) according to standard reconstructive tech- 
niques. It will be recognized that any standard dental reconstructive material could be used. The animals were euth- 
anized two months following surgery, specimens obtained and analyzed as described in the literature (Rutherford et 
al. (1 993), 38 Arch. Oral. Biol . 571 -576. 

[0070] All procedures described above and involving animals were approved by and performed in an accredited 
animal care facility with extensive experience managing non-human primates. These studies were conducting using 
5 adolescent (mixed dentition) male Macaca fasicularis of approximately 4-6 kg each. Dental procedures were per- 
formed on animals heavily sedated with, e.g., ketamine (15 mg/kg body wt.) and acepromazine (0.55 mg/kg body wt) 
supplemented with local intraoral infiltration anesthesia (without vasoconstrictor). 

[0071 ] The variable amounts of reparative dentine observed in this study typically were limited in area to the dentinal 
surface transverse to the luminae of cut dentinal canaliculi. FIGURE 3 is a digitized video image of a typical tissue 
section prepared from an OP-1 treated animal by standard histological techniques. FIGURE 3 shows that reparative 
dentine formed deep to those dentinal canaliculi cut during preparation of the tooth. In most cases, the reparative 
dentine was present in all sections in which both the pulpal floor of the cavity preparation and the subjacent pulp 
chamber were evident. The spatial relationship of the mass of reparative dentine to the pulpal floor appeared to be 
governed by the orientation of the dentinal canaliculi to the long axis of the tooth and to the surface area of cut dentine 
intersecting the canaliculi. This spatial orientation suggests that OP-1 diffused through the dentinal canaliculi. 

[0072] Indeed, the area of new dentine formation two months after morphogen treatment further depended on the 
dose of OP-1 applied. FIGURE 4 shows histomorphometric results illustrating this relationship. The mean thickness 
of the residual dentine was determined by averaging three separate and representative histomorphometric measure- 
ments in each of 5 sections distributed over 75% of the surface area of the cavity preparation. In FIGURE 4, the mean 
area of reparative dentine was determined by averaging three replicate histomorphometric measurements in each of 
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five (5) tissue sections distributed over 75% of the surface area of the cavity preparation. In contrast, there were no 
significant differences between the amount of reparative dentine deep to the cut dentinal canaliculi in teeth to which 
no liners were applied (no treatment) and those treated with evaporative carrier alone. 

[0073] As shown in FIGURES 5 and 6, equivalent amounts of OP-1 (e.g., 1 Opg in fixed equivalent volumes per tooth) 
5 stimulated significantly more reparative dentine two months after treatment than all other treatments attempted, in- 
cluding calcium hydroxide. The degree of stimulation related to the thickness of residual dentine separating the site of 
morphogen application from the pulp chamber wall, and became particularly evident as the thickness of residual dentine 
approached 0.2 mm. Each graphed residual dentine value (0.2, 0.45, 0.75 and 0.9 mm) represents a group of calculated 
values which ranged up to ± 0.15mm. Thus, the area of reparative dentine present two (2) months after lining the 
10 cavities with 10 pg OP-1 , a thin layer of calcium hydroxide, or evaporative carrier alone is expressible as a function of 
the thickness of the residual dentine remaining in the pulpal floor. More reparative dentine was present in OP-1 treated 
than calcium hydroxide treated teeth (ANOVA, Scheffe's F, P<0.05), in calcium hydroxide than carrier treated teeth 
(P<0.05), and in OP-1 than carrier treated teeth (P<0.01). OP-1 at Ipg and calcium hydroxide were equipotent over 
the range of thicknesses of residual dentine (not shown). Smaller amounts of OP-1 were poorly effective in cavities of 
is the size assessed in this study. 

[0074] Resection of the dentinal canaliculi may result in odontoblast death, particularly in the deeper preparations 
(Lee et al. (1992), AM. J. Den. 64-68). However, it is possible that the tooth preparation procedure utilized preserved 
odontoblasts even in the deepest preparations (Smith et al. (1994), 39 Arch . Oral. Biol. 13-22). Hence, the dentine 
formed in these studies may be reactionary dentine, formed by stimulation of the phenotypic function the original odon- 
20 toblasts, or reparative dentine formed by newly differentiated cells deep to the lost odontoblasts (Lesot et al. (1993), 
3 Cells and Materials 201-21 7; Smith et al. (1 994), 39 Arch. Oral. Biol. 1 3-22). The design utilized in these studies did 
not permit temporal observations of the odontoblast layer deep to the cut dentinal canaliculi. Earlier studies demon- 
strated the capacity of OP-1 complexed to an insoluble collagen-based carrier to stimulate reparative dentine when 
placed directly upon partially amputated pulps (U.S.S.N. 08/155,343 (published as WO94/06399) and Rutherford et 
25 al. (1 993), 38 Arch. Oral. Biol. 571 -576; Rutherford et al (1 994), 39 Arch. Oral. Biol. 833-838). Partial pulp amputation 
obviously removes the layer of odontoblasts, exposing the deeper fibrous connective tissue of the pulp. Human pulp 
cells are responsive to OP-1 in vitro, further suggesting that pulp itself contains responsive (competent) cells. The 
specific phenotypes of these OP-1 responsive pulp cells have not yet been identified conclusively. 

30 Example 2. Preparation of Soluble Morphogen Complexes useful in Stimulating Dentineoqesis 

[0075] A currently preferred form of the morphogen useful herein, having improved solubility in aqueous solutions, 
is a dimeric morphogenic protein comprising at least the C-terminal seven cysteine domain characteristic of the mor- 
phogen family, complexed with a peptide comprising a pro region of a member of the morphogen family, or a solubility- 
35 enhancing fragment thereof, or an allelic, species or other sequence variant thereof. Preferably, the dimeric morpho- 
genic protein is complexed with two pro region peptides. Also, the dimeric morphogenic protein preferably is noncov- 
alently complexed with the pro region peptides. The pro region peptides preferably comprise at least the N-terminal 
eighteen amino acids that define the pro domain of a given naturally occurring morphogen, or an allelic or phylogenetic 
counterpart variant thereof. In other preferred embodiments, peptides defining substantially the full length pro domain 
40 are used. 

[0076] Other soluble forms of morphogens include dimers of the uncleaved pro forms of these proteins, as well as 
"hemi-dimers" wherein one subunit of the dimer is an uncleaved pro form of the protein, and the other subunit comprises 
the mature form of the protein, including truncated forms thereof, preferably noncovalently associated with a cleaved 
pro domain peptide. 

45 [0077] As described above and in U.S.S.N. 08/040,510 (published as W094/03600, useful pro domains include the 

full length pro regions, as well as various truncated forms hereof, particularly truncated forms cleaved at proteolytic 
Arg-Xaa-Xaa-Arg (Seq. ID No. 32) cleavage sites within the pro domain polypeptidle. For example, in OP-1, possible 
pro sequences include sequences defined by residues 30-292 (full length form); 48-292; and 158-292. Soluble OP-1 
complex stability is best enhanced when the pro region comprises the full length form rather than a truncated form, 
50 such as the residues 48-292 truncated form, in that residues 30-47 show sequence homology to the N-terminal portions 
of other morphogens, and currently are believed to have particular utility in enhancing complex stability for all morph- 
ogens. Accordingly, currently preferred pro domains include peptides comprising at least the N-terminal fragment, e. 
g., amino acide residues 30-47 of a naturally occurring morphogen pro domain, or a biosynthetic variant thereof that 
retains the solubility and/or stability enhancing properties of the naturally-occurring peptide. 

55 [0078] As will be appreciated by those having ordinary skill in the art, useful sequences encoding the pro region can 

be obtained from genetic sequences encoding known morphogens. Alternatively, chimeric pro regions can be con- 
structed from the sequences of one or more known morphogens. Still another option is to create a synthetic sequence 
variant of one or more known pro region sequences. 
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[0079] In another preferred aspect, useful pro region peptides include polypeptide chains comprising an amino acid 
sequence encoded by a nucleic acid that hybridizes under stringent conditions with a DNA or RNA sequence encoding 
at least the N-terminal eighteen amino acids of the pro region sequence for OP1 or OP2, e.g., nucleotides 136-192 
and 152-211 of Seq. ID No. .15 and 19, respectively. 

5 

2.1 Isolation of soluble morphogen complex from conditioned media or body fluid 

[0080] Morphogens are expressed from mammalian cells as soluble complexes. Typically, however the complex is 
disassociated during purification, generally by exposure to denaturants often added to the purification solutions, such 
io as detergents, alcohols, organic solvents, chaotropic agents and compounds added to reduce the pH of the solution. 
Provided below is a currently preferred protocol for purifying the soluble proteins from conditioned media (or, optionally, 
a body fluid such as serum, cerebrospinal or peritoneal fluid), under non-denaturing conditions. The method is rapid, 
reproducible and yields isolated soluble morphogen complexes in substantially pure form. 

[0081] Soluble morphogen complexes can be isolated from conditioned media using a simple, three step chroma- 
is tographic protocol performed in the absence of denaturants. The protocol involves running the media (or body fluid) 
over an affinity column, followed by ion exchange and gel filtration chromatographies. The affinity column described 
below is a Zn-I MAC column. The present protocol has general applicability to the purification of a variety of morphogens, 
all of which are anticipated to be isolatable using only minor modifications of the protocol described below. An alternative 
protocol also envisioned to have utility includes an immunoaffinity column, created using standard procedures and, for 
20 example, using antibody specific for a given morphogen pro domain (complexed, for example, to a protein A-conjugated 
Sepharose column). Protocols for developing immunoaffinity columns are well described in the art, (see, for example, 
Guide to Protein Purification, M. Deutscher, ed., Academic Press, San Diego, 1990, particularly sections VII and XI 
thereof). 

[0082] In this study, OP-1 was expressed in mammalian (CHO, Chinese hamster ovary) cells as described in the art 
25 (see, for example, international application US90/05903 (W091/05802). The CHO cell conditioned media containing 
0.5% FBS was initially purified using Immobilized Metal-Ion Affinity Chromatography (IMAC). The soluble OP-1 complex 
from conditioned media binds very selectively to the Zn-IMAC resin and a high concentration of imidazole (50 mM 
imidazole, pH 8.0) is required for the effective elution of the bound complex. The Zn-IMAC step separates the soluble 
OP-1 from the bulk of the contaminating serum proteins that elute in the flowthrough and 35 mM imidazole wash 
30 fractions. The Zn-IMAC purified soluble OP-1 is next applied to an S-Sepharose cation-exchange column equilibrated 
in 20 mM NaP0 4 (pH 7.0) with 50 mM NaCI. This S-Sepharose step serves to further purify and concentrate the soluble 
OP-1 complex in preparation for the following gel filtration step. The protein was applied to a Sephacryl S-200HR 
column equilibrated in TBS. Using substantially the same protocol, soluble morphogens also can be isolated from one 
or more body fluids, including serum, cerebrospinal fluid or peritoneal fluid. 

35 [0083] IMAC was performed using Chelating-Sepharose (Pharmacia) that had been charged with three column vol- 

umes of 0.2 M ZnS0 4 . The conditioned media was titrated to pH 7.0 and applied directly to the ZN-IMAC resin equil- 
ibrated in 20 mM HEPES (pH 7.0) with 500 mM NaCI. The Zn-IMAC resin was loaded with 80 mL of starting conditioned 
media per mL of resin. After loading, the column was washed with equilibration buffer and most of the contaminating 
proteins were eluted with 35 mM imidazole (pH 7.0) in equilibration buffer. The soluble OP-1 complex then is eluted 
40 with 50 mM imidazole (pH 8.0) in 20 mM HEPES and 500 mM NaCI. 

[0084] The 50 mM imidazole eluate containing the soluble OP-1 complex was diluted with nine volumes of 20 mM 
NaP0 4 (pH 7.0) and applied to an S-Sepharose (Pharmacia) column equilibrated in 20 mM NaP0 4 (pH 7.0) with 50 
mM NaCI. The S-Sepharose resin was loaded with an equivalent of 800 mL of starting conditioned media per mL of 
resin. After loading the S-Sepharose column was washed with equilibration buffer and eluted with 1 00 mM NaCI followed 
45 by 300 mM and 500 mM NaCI in 20 mM NaP04 (pH 7.0). The 300 mM NaCI pool was further purified using gel filtration 
chromatography. Fifty mis of the 300 mm NaCI eluate was applied to a 5.0 X 90 cm Sephacryl S-200HR (Pharmacia) 
equilibrated in Tris buffered saline (TBS), 50 mM Tris, 150 mM NaCI (pH 7.4). The column was eluted at a flow rate of 
5 mL/minute collecting 10 mL fractions. The apparent molecular of the soluble OP-1 was determined by comparison 
to protein molecular weight standards (alcohol dehydrogenase (ADH, 1 50 kDa), bovine serum albumin (BSA, 68 kDa), 
so carbonic anhydrase (CA, 30 kDa) and cytochrome C (cyt C, 12.5 kDa). The purity of the S-200 column fractions was 
determined by separation on standard 15% polyacrylamide SDS gels stained with coomassie blue. The identity of the 
mature OP-1 and the pro-domain was determined by N-terminal sequence analysis after separation of the mature OP- 
1 from the pro-domain using standard reverse phase Cl 8 HPLC. 

[0085] The soluble OP-1 complex elutes with an apparent molecular weight of 110 kDa. This agrees well with the 
55 predicted composition of the soluble OP-1 complex with one mature OP-1 dimer (35-36 kDa) associated with two pro- 
domains (39 kDa each). Purity of the final complex can be verified by running the appropriate fraction in a reduced 
1 5% polyacrylamide gel. 

[0086] The complex components can be verified by running the complex-containing fraction from the S-200 or S- 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



200HR columns over a reverse phase Cl 8 HPLC column and eluting in an acetonitrile gradient (in 0.1% TFA), using 
standard procedures. The complex is dissociated by this step, and the pro domain and mature species elute as separate 
species. These separate species then can be subjected to N-terminal sequencing using standard procedures (see, for 
example, Guide to Protein Purification, M. Deutscher, ed., Academic Press, San Diego, 1 990, particularly pp. 602-61 3), 
and the identity of the isolated 36kD, 39kDa proteins confirmed as mature morphogen and isolated, cleaved pro domain, 
respectively. N-terminal sequencing of the isolated pro domain from mammalian cell produced OP-1 revealed 2 forms 
of the pro region, the intact form (beginning at residue 30 of Seq. ID No. 16) and a truncated form, (beginning at residue 
48 of Seq. ID No. 16.) N-terminal sequencing of the polypeptide subunit of the isolated mature species reveals a range 
of N-termini for the mature sequence, beginning at residues 293, 300, 313, 315, 316, and 318, of Seq. ID No. 16, all 
of which are active as demonstrated by the standard bone morphogenesis assay set forth in U.S.S.N. 07/752,764 
(published as W092/1 5323). 

2.2 In Vitro Soluble Morphogen Complex Formation 

[0087] As an alternative to purifying soluble complexes from culture media or a body fluid, soluble complexes can 
be formulated from purified pro domains and mature dimeric species. Successful complex formation apparently requires 
association of the components under denaturing conditions sufficient to relax the folded structure of these molecules, 
without affecting disulfide bonds. Preferably, the denaturing conditions mimic the environment of an intracellular vesicle 
sufficiently such that the cleaved pro domain has an opportunity to associate with the mature dimeric species under 
relaxed folding conditions. The concentration of denaturant in the solution then is decreased in a controlled, preferably 
step-wise manner, so as to allow proper refolding of the dimer and pro regions while maintaining the association of the 
pro domain with the dimer. Useful denaturants include 4-6M urea or guanidine hydrochloride (GuHCI), in buffered 
solutions of pH 4-10, preferably pH 6-8. The soluble complex then is formed by controlled dialysis or dilution into a 
solution having a final denaturant concentration of less than 0.1-2M urea or GuHCI, preferably 1-2 M urea of GuHCI, 
which then preferably can be diluted into a physiological buffer. Protein purification/renaturing procedures and consid- 
erations are well described in the art, and details for developing a suitable renaturing protocol readily can be determined 
by one having ordinary skill in the art. One useful text on the subject is Guide to Protein Purification, M. Deutscher, 
ed., Academic Press, San Diego, 1990, particularly section V Complex formation also may be aided by addition of one 
or more chaperone proteins. 

2.3 Stability of Soluble Morphogen Complexes 

[0088] The stability of the highly purified soluble morphogen complex in a physiological buffer, e.g., tris-buffered 
saline (TBS) and phosphate-buffered saline (PBS), can be enhanced by any of a number of means. Currently preferred 
is by means of a pro region that comprises at least the first 18 amino acids of the pro sequence (e.g., residues 30-47 
of Seq. ID NO. 16 for OP-1), and preferably is the full length pro region. Residues 30-47 show sequence homology to 
the N-terminal portion of other morphogens and are believed to have particular utility in enhancing complex stability 
for all morphogens. Other useful means for enhancing the stability of soluble morphogen complexes include three 
classes of additives. These additives include basic amino acids (e.g., L-arginine, lysine and betaine); nonionic deter- 
gents (e.g., Tween 80 or Nonldet P-120); and carrier proteins (e.g., serum albumin and casein). Useful concentrations 
of these additives include 1 -1 00 mM, preferably 1 0-70 mM, including 50 mM, basic amino acid;, 0.01-1 .0%, preferably 
0.05-0.2%, including 0.1% (v/v) nonionic detergent;, and 0.01-1 .0%, preferably 0.05-0.2%, including 0.1% (w/v) carrier 
protein. 

SEQUENCE LISTING 
[0089] 

(1 ) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: CREATIVE BIOMOLECULES, INC 

(B) STREET: 45 SOUTH STREET 

(C) CITY: HOPKINTON 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 01748 
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(G) TELEPHONE: 1-508-435-9001 

(H) TELEFAX: 1-508-435-0454 

(I) TELEX: 

5 (ii) TITLE OF INVENTION: MORPHOGEN-INDUCED DENTINE REGENERATION 

(iii) NUMBER OF SEQUENCES: 32 

(iv) CORRESPONDENCE ADDRESS: 

10 

(A) ADDRESSEE: CREATIVE BIOMOLECULES, INC 

(B) STREET: 45 SOUTH STREET 

(C) CITY: HOPKINTON 

(D) STATE: MA 

15 (E) COUNTRY: USA 

(F) ZIP: 01748 

(v) COMPUTER READABLE FORM: 

20 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1 .0, Version #1 .30 

25 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

30 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: FENTON, GILLIAN M 

(B) REGISTRATION NUMBER: 36,508 

35 (C) REFERENCE/DOCKET NUMBER: CRP-088PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 248-7000 
<■ o (B) TELEFAX: (617) 248-7100 

(2) INFORMATION FOR SEQ ID NO:1 : 

(i) SEQUENCE CHARACTERISTICS: 

45 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

55 (A) NAME/KEY: Protein 

(B) LOCATION: 1..97 

(D) OTHER INFORMATION: /labels Generic-Seq-7 

/note= "wherein each Xaa is independently selected from a group of one or more specified amino acids 
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10 



15 



as defined in the specification." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 



Leu Xaa Xaa Xaa Phe Xaa Xaa Xaa Gly Trp Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 10 15 

Pro Xaa Xaa Xaa Xaa Ala Xaa Tyr Cys Xaa Gly Xaa Cys Xaa Xaa Pro 

20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn His Ala Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Cys Xaa Pro 
50 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 70 75 80 

Val Xaa Leu Xaa Xaa Xaa Xaa Xaa Met Xaa Val Xaa Xaa Cys Xaa Cys 

85 90 95 

Xaa 



25 ( 2 ) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
30 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 ..102 

40 (D) OTHER INFORMATION: /label= Generic-Seq -8 

/note= "wherein each Xaa is independently selected from a group of one or more specified amino acids 
as defined in the specification." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 : 

45 

Cys Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Phe Xaa Xaa Xaa Gly Trp Xaa 
1 5 10 15 

50 Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa Xaa Ala Xaa Tyr Cys Xaa Gly 

20 25 30 

Xaa Cys Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asn His Ala 
35 40 45 

55 
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Xaa 


Xaa 

50 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 

55 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 

60 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 

65 


Cys 


Cys 


Xaa 


Pro 


Xaa 

70 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 

75 


Xaa 


Xaa 


Leu 


Xaa 


Xaa 

80 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 

85 


Val 


Xaa 


Leu 


Xaa 


Xaa 

90 


Xaa 


Xaa 


Xaa 


Met 


Xaa 

95 


Val 


Xaa 


Xaa 


Cys 


Xaa 

100 


Cys 


Xaa 























(2) INFORMATION FOR SEQ ID NO:3: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 02 amino acids 

(B) TYPE: amino acid 

20 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 (ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /label= OPX 

30 /note= “wherein each Xaa is independently selected from a group of one or more specified amino acids 

as defined in the specification." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



35 





Cys 

1 


Xaa 


Xaa 


His 


Glu 

5 


Leu 


Tyr Val 


Xaa 


Phe 

10 


Xaa 


Asp 


Leu 


Gly Trp Xaa 
15 


40 


Asp 


Trp 


Xaa 


lie 

20 


Ala 


Pro 


Xaa Gly Tyr 
25 


Xaa 


Ala Tyr 


Tyr 


Cys Glu Gly 
30 




Glu 


Cys 


Xaa 

35 


Phe 


Pro 


Leu 


Xaa 


Ser 

40 


Xaa 


Met 


Asn 


Ala 


Thr 

45 


Asn His Ala 


45 


lie 


Xaa 

50 


Gin 


Xaa 


Leu Val 


His 

55 


Xaa 


Xaa 


Xaa 


Pro 


Xaa 

60 


Xaa 


Val Pro Lys 




Xaa 

65 


Cys 


Cys 


Ala 


Pro 


Thr 

70 


Xaa 


Leu 


Xaa 


Ala 


Xaa 

75 


Ser 


Val 


Leu Tyr Xaa 
80 


50 


Asp 


Xaa 


Ser 


Xaa 


Asn 

85 


Val 


Xaa 


Leu 


Xaa 


Lys 

90 


Xaa 


Arg 


Asn 


Met Val Val 
95 




Xaa 


Ala 


Cys 


Gly 

100 


Cys 


His 



















55 

(2) INFORMATION FOR SEQ ID NO:4: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
5 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

io (A) ORGANISM: Homo sapiens 

. (F) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 

15 (A) NAME/KEY: Protein 

(B) LOCATION: 1.. 139 

(D) OTHER INFORMATION: /label= hOPI -MATURE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

20 





Ser 


Thr 


Gly 


Ser 


Lys 


Gin Arg 


Ser 


Gin 


Asn 


Arg 


Ser 


Lys 


Thr 


Pro 


Lys 




1 








5 










10 










15 




25 


Asn 


Gin 


Glu 


Ala 


Leu 


Arg 


Met 


Ala 


Asn 


Val 


Ala 


Glu 


Asn 


Ser 


Ser 


Ser 










20 










25 










30 








Asp 


Gin 


Arg 


Gin 


Ala 


Cys 


Lys 


Lys 


His 


Glu 


Leu 


Tyr 


Val 


Ser 


Phe Arg 








35 










40 










45 








30 




































Asp 


Leu 


Gly 


Trp 


Gin 


Asp 


Trp 


lie 


lie 


Ala 


Pro 


Glu 


Gly Tyr Ala Ala 






50 










55 










60 












Tyr 


Tyr 


Cys 


Glu 


Gly Glu 


Cys 


Ala 


Phe 


Pro 


Leu 


Asn 


Ser 


Tyr Met 


Asn 


35 


€5 










70 










75 










80 




Ala 


Thr 


Asn 


His 


Ala 


He 


Val 


Gin 


Thr 


Leu 


Val 


His 


Phe 


lie 


Asn 


Pro 












85 










90 










95 






Glu 


Thr 


Val 


Pro 


Lys 


Pro 


Cys 


Cys 


Ala 


Pro 


Thr 


Gin 


Leu 


Asn 


Ala 


lie 


40 








100 










105 










110 








Ser 


Val 


Leu 


Tyr 


Phe 


Asp 


Asp 


Ser 


Ser 


Asn 


Val 


lie 


Leu 


Lys 


Lys 


Tyr 








115 










120 










125 








45 


Arg 


Asn 


Met 


Val 


Val 


Arg 


Ala 


Cys 


Gly Cys 


His 













130 135 

(2) INFORMATION FOR SEQ ID NO:5: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(vi) ORIGINAL SOURCE: 

(A).ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

5 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1.. 139 

10 (D) OTHER INFORMATION: /labels mOPI -MATURE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



15 


Ser 

1 


Thr Gly Gly 


Lys 

5 


Gin Arg 


Ser 


Gin 


Asn 

10 


Arg 


Ser 


Lys 


Thr 


Pro 

15 


Lys 




Asn 


Gin Glu 


Ala 

20 


Leu 


Arg 


Met 


Ala 


Ser 

25 


Val 


Ala 


Glu 


Asn 


Ser 

30 


Ser 


Ser 


20 


Asp 


Gin Arg 
35 


Gin 


Ala 


Cys 


Lys 


Lys 

40 


His 


Glu 


Leu 


Tyr Val 
45 


Ser 


Phe 


Arg 


25 


Asp 


Leu Gly Trp 
50 


Gin 


Asp 


Trp 

55 


He 


lie 


Ala 


Pro 


Glu Gly Tyr Ala 
60 


Ala 




Tyr 

65 


Tyr Cys 


Glu 


Gly 


Glu 

70 


Cys 


Ala 


Phe 


Pro 


Leu 

75 


Asn 


Ser 


Tyr Met 


Asn 

80 


30 


Ala 


Thr Asn 


His 


Ala 

85 


lie 


val 


Gin 


Thr 


Leu 

90 


Val 


His 


Phe 


lie 


Asn 

95 


Pro 




Asp 


Thr Val 


Pro 

100 


Lys 


Pro 


Cys 


Cys 


Ala 

105 


Pro 


Thr 


Gin 


Leu 


Asn 

110 


Ala 


lie 


35 


Ser 


Val Leu 
115 


Tyr 


Phe 


Asp Asp 


Ser 

120 


Ser 


Asn 


Val 


lie 


Leu 

125 


Lys 


Lys 


Tyr 




Arg 


Asn Met 


Val 


Val 


Arg 


Ala 


Cys 


Gly Cys 


His 













130 135 

40 (2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 
(F) TISSUE TYPE: HIPPOCAMPUS 

55 

(ix) FEATURE: 

(A) NAME/KEY: Protein 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(B) LOCATION: 1 ..139 

(D) OTHER INFORMATION: /iabel= HOP2-MATURE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu 

1 5 10 15 

Pro Gin Ala Asn Arg Leu Pro Gly lie Phe Asp Asp Val His Gly Ser 

20 25 30 



His Gly Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin 
35 40 45 

Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin Gly Tyr Ser Ala 
50 55 60 

Tyr Tyr Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn 

65 70 75 80 

Ala Thr Asn His Ala lie Leu Gin Ser Leu Val His Leu Met Lys Pro 

85 90 95 

Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 110 

Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 
115 120 125 

Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
130 135 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 ..139 

(D) OTHER INFORMATION: /label= MOP2-MATURE 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lys Thr Asn Glu Leu 

1 5 10 15 

Pro His Pro Asn Lys Leu Pro Gly lie Phe Asp Asp Gly His Gly Ser 

20 25 30 

Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Arg 
35 40 45 

Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin Gly Tyr Ser Ala 
50 55 60 

Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asp Ser Cys Met Asn 

65 70 75 80 

Ala Thr Asn His Ala lie Leu Gin Ser Leu Val His Leu. Met Lys Pro 

85 90 95 



Asp Val Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 no 

Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 
115 120 125 

Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
130 135 



(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bovinae 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 ..101 

(D) OTHER INFORMATION: /labe!= CBMP-2A-FX 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 



55 
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Cys 

1 


Lys 


Arg 


His 


Pro 

5 


Leu 


Tyr Val 


Asp Phe 
10 


Ser 


Asp Val 


Gly Trp 
15 


Asn 


5 


Asp 


Trp 


He 


Val 

20 


Ala 


Pro 


Pro 


Gly Tyr His 
25 


Ala 


Phe 


Tyr 


Cys 

30 


His 


Gly 




Glu 


Cys 


Pro 

35 


Phe 


Pro 


Leu 


Ala 


Asp His 
40 


Leu 


Asn 


Ser 


Thr 
45 . 


Asn 


His 


Ala 


10 


lie 


Val 

50 


Gin 


Thr 


Leu 


Val 


Asn 

55 


Ser 


Val 


Asn 


Ser 


Lys 

60 


lie 


Pro 


Lys 


Ala 


15 


Cys 

65 


Cys 


Val 


Pro 


Thr 


Glu 

70 


Leu 


Ser 


Ala 


lie 


Ser 

75 


Met 


Leu Tyr 


Leu 


Asp 

80 




Glu 


Asn 


Glu 


Lys 


Val 

85 


Val 


Leu 


Lys 


Asn 


Tyr 

90 


Gin Asp 


Met 


Val 


Val 

95 


GlU 



Gly Cys Gly Cys Arg 
20 100 



(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

25 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
30 

(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

35 (A) ORGANISM: HOMO SAPIENS 

(F) TISSUE TYPE: hippocampus 

(ix) FEATURE: 

40 (A) NAME/KEY: Protein 

(B) LOCATION: 1 ..101 

(D) OTHER INFORMATION: /label= CBMP-2B-FX 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

45 



50 



55 
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10 



15 



20 



Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn 

1 5 10 15 

Asp Trp lie Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly 

20 25 30 

Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Ash His Ala 
35 40 45 

lie Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser lie Pro Lys Ala 
50 55 60 

Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp 

55 70 75 80 

Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu 

85 90 95 

Gly Cys Gly Cys Arg 
100 



(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

25 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
30 

(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: DROSOPHILA MELANOGASTER 
35 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

40 (D) OTHER INFORMATION: /label= DPP-FX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 



45 Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asp 

15 10 15 

Asp Trp lie Val Ala Pro Leu Gly Tyr Asp Ala Tyr Tyr Cys His Gly 
20 25 30 

50 



55 
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Lys 


Cys 


Pro 

35 


Phe Pro Leu 


Ala 


Asp 

40 


His 


Phe 


Asn 


Ser 


Thr 

45 


Asn 


His 


Ala 


5 


Val 


Val 

50 


Gin 


Thr Leu Val 


Asn 

55 


Asn 


Asn 


Asn 


Pro Gly Lys 
60 


Val 


Pro 


Lys 




Ala 

65 


Cys 


Cys 


Val Pro Thr 
70 


Gin 


Leu Asp 


Ser 


Val 

75 


Ala 


Met 


Leu 


Tyr 


Leu 

80 


10 


Asn 


Asp 


Gin 


Ser Thr Val 
85 


Val 


Leu 


Lys 


Asn 

90 


Tyr 


Gin 


Glu 


Met 


Thr 

95 


Val 




Val 


Gly 


Cys 


Gly Cys Arg 























(2) INFORMATION FOR SEQ ID NO:11: 
(i) SEQUENCE CHARACTERISTICS: 



20 



25 



(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: XENOPUS 



30 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1.102 

35 (D) OTHER INFORMATION: /labels VGL-FX 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11 : 



40 Cys Lys Lys Arg His Leu Tyr Val Glu Phe Lys Asp Val Gly Trp Gin 

1 5 10 15 

Asn Trp Val lie Ala Pro Gin Gly Tyr Met Ala Asn Tyr Cys Tyr Gly 

20 25 30 

45 Glu Cys Pro Tyr Pro Leu Thr Glu lie Leu Asn Gly Ser Asn His Ala 

35 40 45 

lie Leu Gin Thr Leu Val His Ser lie Glu Pro Glu Asp lie Pro Leu 
50 55 60 

50 

Pro Cys Cys Val Pro Thr Lys Met Ser Pro lie Ser Met Leu Phe Tyr 
65 70 75 80 

Asp Asn Asn Asp Asn Val Val Leu Arg His Tyr Glu Asn Met Ala Val 

55 85 90 95 

Asp Glu Cys Gly Cys Arg 
100 
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(2) INFORMATION FOR SEQ ID NO:12: 

: (i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 1 02 amino acids' 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

io (ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 

is ■ (ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1.. 102 

(D) OTHER INFORMATION: /label= VGR-1-FX 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 





Cys 


Lys 


Lys 


His 


Glu 


Leu 


Tyr Val 


Ser 


Phe 


Gin 


Asp Val 


Gly 


Trp 


Gin 


25 


1 








5 








10 








15 






Asp 


Trp 


lie 


lie 


Ala 


Pro 


Lys Gly Tyr Ala 


Ala 


Asn Tyr 


Cys 


Asp Gly 










20 








25 










30 






30 


Glu 


Cys 


Ser 


Phe 


Pro 


Leu 


Asn Ala 


His 


Met 


Asn 


Ala 


Thr 


Asn 


His 


Ala 








35 








40 










45 










He 


Val 


Gin 


Thr 


Leu 


Val 


His Val 


Met 


Asn 


Pro 


Glu 


Tyr 


Val 


Pro 


Lys 






50 










55 








60 






35 


Pro 


Cys 


Cys 


Ala 


Pro 


Thr 


Lys Val 


Asn 


Ala 


lie 


Ser 


Val 


Leu 


Tyr 


Phe 




65 










70 








75 








80 




Asp 


Asp 


Asn 


Ser 


Asn 


Val 


lie Leu 


Lys 


Lys 


Tyr 


Arg 


Asn 


Met 


Val 


Val 












85 








90 










95 




40 


Arg 


Ala 


Cys 


Gly 


Cys 


His 





















(2) INFORMATION FOR SEQ ID NO:13: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

50 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: brain 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 ..106 

(D) OTHER INFORMATION: /note= "GDF-1 (fx)" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



Cys Arg Ala Arg Arg Leu Tyr Val Ser Phe Arg Glu Val Gly Trp His 

1 5 10 15 

Arg Trp Val lie Ala Pro Arg Gly Phe Leu Ala Asn Tyr Cys Gin Gly 

20 25 30 

Gin Cys Ala Leu Pro Val Ala Leu Ser Gly Ser Gly Gly Pro Pro Ala 
35 40 45 

Leu Asn His Ala Val Leu Arg Ala Leu Met His Ala Ala Ala Pro Glv 
50 55 60 

Ala Ala Asp Leu Pro Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser 

65 70 75 80 

Val Leu Phe Phe Asp Asn Ser Asp Asn Val Val Leu Arg Gin Tyr Glu 

85 90 95 

Asp Met Val Val Asp Glu Cys Gly Cys Arg 

100 105 

(2) INFORMATION FOR SEQ ID NO:t 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 



Cys Xaa Xaa Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO:15: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: HOMO SAPIENS 
(F) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 

5. 

(A) NAME/KEY: CDS 

(B) LOCATION: 49.. 1341 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /function= 

10 "OSTEOGENIC PROTEIN" 

/product= M OP1" 

/evidence^ EXPERIMENTAL 
/standard_name= "OP1° 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 5: 



20 



25 



30 



35 



40 



45 



50 



55 
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5 



10 



15 



20 



.25 



30 



35 



40 



45 



50 



55 



GGTGCGGGCC CGGAGCCCGG AGCCCGGGTA GCGCGTAGAG CCGGCGCG ATG CAC GTG 57 

Met His Val 
1 



CGC 


TCA 


CTG 


CGA 


GCT 


GCG 


GCG 


CCG 


CAC 


AGC 


TTC 


GTG 


GCG 


CTC 


TGG 


GCA 


105 


Arg 


Ser 


Leu 


Arg 


Ala 


Ala 


Ala 


Pro 


His 


Ser 


Phe 


Val 


Ala 


Leu 


Trp 


Ala 






5 










10 










15 












ccc 


CTG 


TTC 


CTG 


CTG 


CGC 


TCC 


GCC 


CTG 


GCC 


GAC 


TTC 


AGC 


CTG 


GAC 


AAC 


153 


Pro 


Leu 


Phe 


Leu 


Leu 


Arg 


Ser 


Ala 


Leu 


Ala Asp 


Phe 


Ser 


Leu Asp 


Asn 




20 










25 










30 










35 




GAG 


GTG 


CAC 


TCG 


AGC 


TTC 


ATC 


CAC 


CGG 


CGC 


CTC 


CGC 


AGC 


CAG 


GAG 


CGG 


201 


Glu 


Val 


His 


Ser 


Ser 


Phe 


lie 


His 


Arg Arg 


Leu 


Arg 


Ser 


Gin 


Glu Arg 












40 










45 










50 






CGG 


GAG 


ATG 


CAG 


CGC 


GAG 


ATC 


CTC 


TCC 


ATT 


TTG 


GGC 


TTG 


ccc 


CAC 


CGC 


249. 


Arg Glu 


Met 


Gin Arg 


Glu 


lie 


Leu 


Ser 


lie 


Leu Gly 


Leu 


Pro 


His 


Arg 










55 










60 










65 








CCG 


CGC 


CCG 


CAC 


CTC 


CAG 


GGC 


AAG 


CAC 


AAC 


TCG 


GCA 


CCC 


ATG 


TTC 


ATG 


297 


Pro Arg 


Pro 


His 


Leu 


Gin 


Gly Lys 


His 


Asn 


Ser 


Ala 


Pro 


Met 


Phe 


Met 








70 










75 










80 










CTG 


GAC 


CTG 


TAC 


AAC 


GCC 


ATG 


GCG 


GTG 


GAG 


GAG 


GGC 


GGC 


GGG 


CCC 


GGC 


345 


Leu Asp 


Leu 


Tyr 


Asn 


Ala 


Met 


Ala 


Val 


Glu 


Glu 


Gly Gly Gly 


Pro Gly 






85 










90 










95 












GGC 


CAG 


GGC 


TTC 


TCC 


TAC 


CCC 


TAC 


AAG 


GCC 


GTC 


TTC 


AGT 


ACC 


CAG 


GGC 


393 


Gly Gin Gly 


Phe 


Ser 


Tyr 


Pro 


Tyr 


Lys 


Ala 


Val 


Phe 


Ser 


Thr 


Gin Gly 




100 










105 










110 










115 




CCC 


CCT 


CTG 


GCC 


AGC 


CTG 


CAA 


GAT 


AGC 


CAT 


TTC 


CTC 


ACC 


GAC 


GCC 


GAC 


441 


Pro 


Pro 


Leu 


Ala 


Ser 


Leu 


Gin 


Asp 


Ser 


His 


Phe 


Leu 


Thr 


Asp 


Ala 


Asp 












120 










125 










130 






ATG 


GTC 


ATG 


AGC 


TTC 


GTC 


AAC 


CTC 


GTG 


GAA 


CAT 


GAC 


AAG 


GAA 


TTC 


TTC 


489 


Met 


Val 


Met 


Ser 


Phe 


Val 


Asn 


Leu 


Val 


Glu 


His 


Asp 


Lys 


Glu 


Phe 


Phe 










135 










140 










14,5 








CAC 


CCA 


CGC 


TAC 


CAC 


CAT 


CGA 


GAG 


TTC 


CGG 


TTT 


GAT 


CTT 


TCC 


AAG 


ATC 


537 


His 


Pro 


Arg 


Tyr 


His 


His 


Arg 


Glu 


Phe 


Arg 


Phe 


Asp 


Leu 


Ser 


Lys 


lie 








150 










155 










160 










CCA 


GAA 


GGG 


GAA 


GCT 


GTC 


ACG 


GCA 


GCC 


GAA 


TTC 


CGG 


ATC 


TAC 


AAG 


GAC 


585 


Pro 


Glu 


Gly Glu 


Ala 


Val 


Thr 


Ala 


Ala 


Glu 


Phe 


Arg 


He 


Tyr 


Lys 


Asp 






165 










170 










175 












TAC 


ATC 


CGG 


GAA 


CGC 


TTC 


GAC 


AAT 


GAG 


ACG 


TTC 


CGG 


ATC 


AGC 


GTT 


TAT 


633 


Tyr 


He 


Arg Glu 


Arg 


Phe 


Asp Asn 


Glu 


Thr 


Phe Arg 


lie 


Ser 


Val 


Tyr 




180 










185 










190 










195 




CAG 


GTG 


CTC 


CAG 


GAG 


CAC 


TTG 


GGC 


AGG 


GAA 


TCG 


GAT 


CTC 


TTC 


CTG 


CTC 


681 


Gin 


Val 


Leu 


Gin 


Glu 


His 


Leu Gly 


Arg 


Glu 


Ser 


Asp 


Leu 


Phe 


Leu 


Leu 





200 205 210 
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50 



55 



GAC AGC CGT ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT GAC 

Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp 

215 220 225 

ATC ACA GCC ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC AAC CTG 

lie Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu 

230 235 240 

GGC CTG CAG CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC ATC AAC CCC 

Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro 

245 250 255 

AAG TTG GCG GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC 

Lys Leu Ala Gly Leu lie Gly Arg His Gly Pro Gin Asn Lys Gin Pro 

260 265 270 275 

TTC ATG GTG GCT TTC TTC AAG GCC ACG GAG GTC CAC TTC CGC AGC ATC 

Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe Arg Ser lie 

280 285 290 

CGG TCC ACG GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC 

Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro 

295 300 305 

AAG AAC CAG GAA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC 

Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser 

310 315 320 

AGC GAC CAG AGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC 

Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe 

325 330 335 

CGA GAC CTG GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC 

Arg Asp Leu Gly Trp Gin Asp Trp lie lie Ala Pro Glu Gly Tyr Ala 

340 345 350 355 

GCC TAC TAC TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG 

Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met 

360 365 370 

AAC GCC ACC AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC 

Asn Ala Thr Asn His Ala lie Val Gin Thr Leu Val His Phe lie Asn 

375 380 385 

CCG GAA ACG GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC 

Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala 

390 ' 395 400 

ATC TCC GTC CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA 

lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val lie Leu Lys Lys 

405 410 415 

TAC AGA AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC 

Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 

420 425 430 

GAGAATTCAG ACCCTTTGGG GCCAAGTTTT TCTGGATCCT . CCATTGCTCG CCTTGGCCAG 
GAACCAGCAG ACCAACTGCC TTTTGTGAGA CCTTCCCCTC CCTATCCCCA ACTTTAAAGG 
TGTGAGAGTA TTAGGAAACA TGAGCAGCAT ATGGCTTTTG ATCAGTTTTT CAGTGGCAGC 



729 

777 

825 

873 

921 

969 

1017 

1065 

1113 

1161 

1209 

1257 

1305 

1351 

1411 

1471 

1531 
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20 



ATCCAATGAA CAAGATCCTA CAAGCTGTGC AGGCAAAACC TAGCAGGAAA AAAAAACAAC 
GCATAAAGAA AAATGGCCGG GCCAGGTCAT TGGCTGGGAA GTCTCAGCCA TGCACGGACT 
CGTTTCCAGA GGTAATTATG AGCGCCTACC AGCCAGGCCA CCCAGCCGTG GGAGGAAGGG 
GGCGTGGCAA GGGGTGGGCA CATTGGTGTC TGTGCGAAAG GAAAATTGAC CCGGAAGTTC 
CTGTAATAAA TGTCACAATA AAACGAATGA ATGAAAAAAA AAAAAAAAAA A 

(2) INFORMATION FOR SEQ ID NO:1 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 



25 



30 



35 



40 



45 



50 



55 



1591 

1651 

1711 

1771 

1822 
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30 



35 



Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 

1 5 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 

20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser lie Leu Gly Leu 
50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 70 75 80 

Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 
85 90 95 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 110 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 

Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
130 135 140 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 160 

Ser Lys He Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie 
165 170 175 

Tyr Lys Asp Tyr lie Arg Glu Arg phe Asp Asn Glu Thr Phe Arg lie 
180 185 190 

Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 200 205 



40 



45 



50 



55 



35 




EP 0 812 207 B1 



Phe 


Leu 

210 


Leu 


Asp 


Ser 


Arg 


Thr 

215 


Leu 


Trp 


Ala 


Ser 


Glu 

220 


Glu 


Gly Trp 


Leu 


Val 

225 


Phe 


Asp 


lie 


Thr 


Ala 

230 


Thr 


Ser 


Asn 


His 


Trp Val 
235 


Val 


Asn 


Pro Arg 
240 


His 


Asn 


Leu Gly 


Leu 

245 


Gin 


Leu 


Ser 


Val 


Glu 

250 


Thr 


Leu 


Asp Gly Gin 
255 


Ser 


He 


Asn 


Pro 


Lys 

260 


Leu 


Ala Gly Leu 


lie 

265 


Gly Arg 


His 


Gly Pro 
270 


Gin 


Asn 


Lys 


Gin 


Pro 

275 


Phe 


Met 


Val 


Ala 


Phe 

280 


Phe 


Lys 


Ala 


Thr 


Glu 

285 


Val 


His 


Phe 


Arg 


Ser 
2 90 


lie 


Arg 


Ser 


Thr 


Gly Ser Lys 
295 


Gin 


Arg 


Ser Gin 
300 


Asn 


Arg 


Ser 


Lys 

305 


Thr 


Pro 


Lys 


Asn 


Gin 

310 


Glu 


Ala 


Leu 


Arg 


Met 

315 


Ala 


Asn 


Val 


Ala 


Glu 

320 


Asn 


Ser 


Ser 


Ser 


Asp 

325 


Gin 


Arg 


Gin 


Ala 


Cys 

330 


Lys 


Lys 


His 


Glu 


Leu 

335 


Tyr 


Val 


Ser 


Phe 


Arg 

340 


Asp 


Leu Gly Trp 


Gin 

345 


Asp 


Trp 


lie 


lie 


Ala 

350 


Pro 


Glu 


Gly 


Tyr 


Ala 

355 


Ala 


Tyr 


Tyr 


Cys 


Glu Gly 
360 


Glu 


Cys 


Ala 


Phe 

365 


Pro 


Leu 


Asn 


Ser 


Tyr 

370 


Met 


Asn 


Ala 


Thr 


Asn 

375 


His 


Ala 


lie 


Val 


Gin 

380 


Thr 


Leu 


Val 


His 


Phe 

385 


lie 


Asn 


Pro 


Glu 


Thr 

390 


Val 


Pro 


Lys 


Pro 


Cys 

395 


Cys 


Ala 


Pro 


Thr 


Gin 

400 


Leu 


Asn 


Ala 


lie 


Ser 

405 


Val 


Leu Tyr 


Phe 


Asp 

410 


Asp 


Ser 


Ser 


Asn 


Val 

415 


lie 


Leu 


Lys 


Lys 


Tyr 

420 


Arg 


Asn 


Met 


Val 


Val 

425 


Arg 


Ala 


Cys 


Gly Cys 
430 


His 





(2) INFORMATION FOR SEQ ID NO:17: 
« (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1873 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

so (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

55 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 



36 




(ix) FEATURE: 



(A) NAME/KEY: CDS 

(B) LOCATION: 104..1393 

(D) OTHER INFORMATION: /function= "OSTEOGENIC PROTEIN". 
/product= “MOP1 ” 

/note= "MOP1 (CDNA) n 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
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CTGCAGCAAG TGACCTCGGG TCGTGGACCG CTGCCCTGCC CCCTCCGCTG CCACCT GGGG 

CGGCGCGGGC CCGGTGCCCC. GGATCGCGCG TAGAGCGGGC GCG ATG CAC GTG CGC 

Met His Val Arg 
1 





TCG 


CTG 


CGC 


GCT 


GCG 


GCG 


CCA 


CAC 


10 


Ser 

5 


Leu 


Arg 


Ala 


Ala 


Ala 

10 


Pro 


His 




CTG 


TTC 


TTG 


CTG 


CGC 


TCC 


GCC 


CTG 




Leu 


Phe 


Leu 


Leu 


Arg 

25 


Ser 


Ala 


Leu 


15 


GTG 


CAC 


TCC 


AGC 


TTC 


ATC 


CAC 


CGG 




Val 


His 


Ser 


Ser 

40 


Phe 


lie 


His 


Arg 




GAG 


ATG 


CAG 


CGG 


GAG 


ATC 


CTG 


TCC 


20 


Glu 


Met 


Gin 

55 


Arg 


Glu 


lie 


Leu 


Ser 

60 




CGC 


CCG 


CAC 


CTC 


CAG 


GGA 


AAG 


CAT 


25 


Arg 


Pro 

70 


His 


Leu 


Gin 


Gly 


Lys 

75 


His 




GAC 


CTG 


TAC 


AAC 


GCC 


ATG 


GCG 


GTG 




Asp Leu 
85 


Tyr 


Asn 


Ala 


Met 

90 


Ala 


Val 


30 


GGC 


TTC 


TCC 


TAC 


CCC 


TAC 


AAG 


GCC 




Gly 


Phe 


Ser 


Tyr 


Pro 

105 


Tyr 


Lys 


Ala 




TTA 


GCC 


AGC 


CTG 


CAG 


GAC 


AGC 


CAT 


35 


Leu 


Ala 


Ser 


Leu 

120 


Gin 


Asp 


Ser 


His 




ATG 


AGC 


TTC 


GTC 


AAC 


CTA 


GTG 


GAA 


40 


Met 


Ser 


Phe 

135 


val 


Asn 


Leu 


Val 


Glu 

140 




CGA 


TAC 


CAC 


CAT 


CGG 


GAG 


TTC 


CGG 




Arg 


Tyr 

150 


His 


His 


Arg 


Glu 


Phe 

155 


Arg 


45 


GGC 


GAA 


CGG 


GTG 


ACC 


GCA 


GCC 


GAA 




Gly Glu 
165 


Arg 


Val 


Thr 


Ala 

170 


Ala 


Glu 




CGG 


GAG 


CGA 


TTT 


GAC 


AAC 


GAG 


ACC 


50 


Arg Glu 


Arg 


Phe 


Asp 

185 


Ash 


Glu 


Thr 




CTC 


CAG 


GAG 


CAC 


TCA 


GGC 


AGG 


GAG 


55 


Leu 


Gin 


Glu 


His 

200 


Ser 


Gly 


Arg 


Glu 



AGC TTC 
Ser Phe 



GCC GAT 
Ala Asp 
30 

CGC CTC 
Arg Leu 
45 

ATC TTA 
lie Leu 



AAT TCG 
Asn Ser 



GAG GAG 
Glu Glu 



GTC TTC 
Val Phe 
110 

TTC CTC 
Phe Leu 
125 

CAT GAC 
His Asp 



TTT GAT 
Phe Asp 



TTC AGG 
Phe Arg 



TTC CAG 
Phe Gin 
190 

TCG GAC 
Ser Asp 
205 



GTG GCG 
Val Ala 
15 

TTC AGC 
Phe Ser 



CGC AGC 
Arg Ser 



GGG TTG 
Gly Leu 



GCG CCC 
Ala Pro 
80 

AGC GGG 
Ser Gly 
95 

AGT ACC 
Ser Thr 



ACT GAC 
Thr Asp 



AAA GAA 
Lys Glu 



CTT TCC 
Leu Ser 
160 

ATC TAT 
He Tyr 
175 

ATC ACA 
lie Thr 



CTC TTC 
Leu Phe 



CTC TGG 
Leu Trp 



CTG GAC 
Leu Asp 



CAG GAG 
Gin Glu 
50 

CCC CAT 
Pro His 
65 

ATG TTC 
Met Phe 



CCG GAC 
Pro Asp 



CAG GGC 
Gin Gly 

GCC GAC 
Ala Asp 
130 

TTC TTC 
Phe Phe 
145 

AAG ATC 
Lys lie 



AAG GAC 
Lys Asp 



GTC TAT 
Val Tyr 



TTG CTG 
Leu Leu 
210 



GCG CCT 
Ala Pro 
20 

AAC GAG 
Asn Glu 
35 

CGG CGG 
Arg Arg 



CGC CCG 
Arg Pro 



ATG TTG 
Met Leu 



GGA CAG 
Gly Gin 
100 

CCC CCT 
Pro Pro 
115 

ATG GTC 
Met Val 



CAC CCT 
His Pro 



CCC GAG 
Pro Glu 



TAC ATC 
Tyr lie 
180 

CAG GTG 
Gin Val 
195 

GAC AGC 
Asp Ser 



60 

115 

163 

211 

259 

307 

355 

403 

451 

499 

547 

595 

643 

691 

739 
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CGC ACC ATC TGG GCT TCT GAG GAG GGC TGG TTG GTG TTT GAT ATC ACA 

Arg Thr He Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp lie Thr 

215 220 225 

GCC ACC AGC AAC CAC TGG GTG GTC AAC CCT CGG CAC AAC CTG GGC TTA 

Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu Gly Leu 

230 235 240 

CAG CTC TCT GTG GAG ACC CTG GAT GGG CAG AGC ATC AAC CCC AAG TTG 

Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser lie Asn Pro Lys Leu 

245 250 255 260 

GCA GGC CTG ATT GGA CGG CAT GGA CCC CAG AAC AAG CAA CCC TTC ATG. 

Ala Gly Leu lie Gly Arg His Gly Pro Gin Asn Lys Gin Pro Phe Met 

265 270 275 

GTG GCC TTC TTC AAG GCC ACG GAA GTC CAT CTC CGT AGT ATC CGG TCC 

Val Ala Phe Phe Lys Ala Thr Glu Val His Leu Arg Ser lie Arg Ser 

280 285 290 

ACG GGG GGC AAG CAG CGC AGC CAG AAT CGC TCC AAG ACG CCA AAG AAC 

Thr Gly Gly Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys Asn 

295 300 305 

CAA GAG GCC CTG AGG ATG GCC AGT GTG GCA GAA AAC AGC AGC AGT GAC 

Gin Glu Ala Leu Arg Met Ala Ser Val Ala Glu Asn Ser Ser Ser Asp 

310 315 320 

CAG AGG CAG GCC TGC AAG AAA CAT GAG CTG TAC GTC AGC TTC CGA GAC 

Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp 

325 330 335 340 

CTT GGC TGG CAG GAC TGG ATC ATT GCA CCT GAA GGC TAT GCT GCC TAC 

Leu Gly Trp Gin Asp Trp lie lie Ala Pro Glu Gly Tyr Ala Ala Tyr 

345 350 355 

TAC TGT GAG GGA GAG TGC GCC TTC CCT CTG AAC TCC TAC ATG AAC GCC 

Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala 

360 365 370 

ACC AAC CAC GCC ATC GTC CAG ACA CTG GTT CAC TTC ATC AAC CCA GAC 

Thr Asn His Ala lie Val Gin Thr Leu Val His Phe lie Asn Pro Asp 

375 380 385 

ACA GTA CCC AAG CCC TGC TGT GCG CCC ACC CAG CTC AAC GCC ATC TCT 

Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala lie Ser 

390 395 400 

GTC CTC TAC TTC GAC GAC AGC TCT AAT GTC ATC CTG AAG AAG TAC AGA 

Val Leu Tyr Phe Asp Asp Ser Ser Asn Val lie Leu Lys Lys Tyr Arg 

405 410 415 420 

AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCTTCC TGAGACCCTG 
Asn Met Val Val Arg Ala Cys Gly Cys His 
425 430 

ACCTTTGCGG GGCCACACCT TTCCAAATCT TCGATGTCTC AC CAT CT AAG TCTCTCACTG 
CCCACCTTGG CGAGGAGAAC AGACCAACCT CTCCTGAGCC TTCCCTCACC TCCCAACCGG 
AAGCATGTAA GGGTTCCAGA AACCTGAGCG TGCAGCAGCT GATGAGCGCC CTTTCCTTCT 
GGCACGTGAC GGACAAGATC CTACCAGCTA CCACAGCAAA CGCCTAAGAG CAGGAAAAAT 



707 

835 

- 883 

931 

979 

1027 

1075 

1123 

1171 

1219 

1267 

1315 

1363 

1413 

1473 

1533 

1593 

1653 



39 




GTCTGCCAGG AAAGTGTCCA GTGTCCACAT GGCCCCTGGC GCTCTGAGTC TTTGAGGAGT 



1713 



AATCGCAAGC CTCGTTCAGC TGCAGCAGAA GGAAGGGCTT AGCCAGGGTG GGCGCTGGCG 
TCTGTGTTGA AGGGAAACCA AGCAGAA GCC ACTGTAATGA TATGTCACAA TAAAACCCAT 
GAATGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAGAATTC 

(2) INFORMATION FOR SEQ ID N0:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 



1773 

1833 

1873 
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5 



10 



15 



20 



25 



30 



35 



40 



Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 

1 5 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 

20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu lie Leu Ser lie Leu Gly Leu 
50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
65 70 75 80 

Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Ser Gly 
85 90 95 

Pro Asp Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr 
100 105 110 

Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp 
115 120 125 

Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys Glu 
130 135 . 140 

Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser 
145 150 155 160 

Lys He Pro Glu Gly Glu Arg Val Thr Ala Ala Glu Phe Arg lie Tyr 
165 170 175 

Lys Asp Tyr lie Arg Glu Arg Phe Asp Asn Glu Thr Phe Gin He Thr 
180 185 190 

Val Tyr Gin Val Leu Gin Glu His Ser Gly Arg Glu Ser Asp Leu Phe 
195 200 205 

Leu Leu Asp Ser Arg Thr lie Trp Ala Ser Glu Glu Gly Trp Leu Val 



45 



50 



55 



41 
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210 215 220 





Phe 


Asp 


lie 


Thr 


Ala 


Thr 


Ser 


Asn His 


Trp Val 


Val 


Asn 


Pro Arg 


His 


5 


225 










230 










235 










240 




Asn 


Leu 


Gly Leu 


Gin 


Leu 


Ser 


Val 


Glu 


Thr 


Leu Asp Gly 


Gin 


Ser 


lie 












245 










250 










255 




10 


Asn 


Pro 


Lys 


Leu 


Ala 


Gly Leu 


He Gly 


Arg His Gly 


Pro 


Gin 


Asn 


Lys 










260 










265 










270 






Gin 


Pro 


Phe 


Met 


val 


Ala 


Phe 


Phe 


Lys 


Ala 


Thr 


Glu 


Val 


His 


Leu 


Arg 








275 










280 










285 






15 


Ser 


lie 


Arg 


Ser 


Thr 


Gly Gly Lys 


Gin 


Arg 


Ser 


Gin 


Asn Arg 


Ser 


Lys 






290 










295 










300 










Thr 


Pro 


Lys 


Asn 


Gin 


Glu 


Ala 


Leu 


Arg 


Met 


Ala 


Ser 


Val 


Ala 


Glu 


Asn 


20 


305 










310 










315 










320 




Ser 


Ser 


Ser 


Asp 


Gin 


Arg 


Gin 


Ala 


Cys 


Lys 


Lys 


His 


Glu 


Leu 


Tyr Val 












325 










330 










335 






Ser 


Phe 


Arg 


Asp 


Leu Gly Trp 


Gin Asp 


Trp 


lie 


He 


Ala 


Pro 


Glu Gly 


25 








340 










345 










350 








Tyr Ala 


Ala 


Tyr 


Tyr 


Cys 


Glu Gly 


Glu 


Cys 


Ala 


Phe 


Pro 


Leu 


Asn 


Ser 








355 










360 










365 








30 


Tyr 


Met 


Asn 


Ala 


Thr 


Asn 


His 


Ala 


lie 


Val 


Gin 


Thr 


Leu 


Val 


His 


Phe 




370 










375 










380 












He 


Asn 


Pro 


Asp 


Thr 


Val 


Pro 


Lys 


Pro 


Cys 


Cys 


Ala 


Pro 


Thr 


Gin 


Leu 




385 










390 










395 










400 


35 


Asn 


Ala 


lie 


Ser 


Val 


Leu Tyr 


Phe 


Asp Asp 


Ser 


Ser 


Asn 


Val 


lie 


Leu 












405 










410 










415 






Lys 


Lys 


Tyr 


Arg 


Asn 


Met 


Val 


Val 


Arg Ala 


Cys Gly Cys 


His 














420 










425 


* 








430 







(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

45 

(A) LENGTH: 1723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

55 (A) ORGANISM: HOMO SAPIENS 

(F) TISSUE TYPE: HIPPOCAMPUS 

(ix) FEATURE: 



42 




EP 0 812 207 B1 



(A) NAME/KEY: CDS 

(B) LOCATION: 490.. 1695 

(D) OTHER INFORMATION: /function= "OSTEOGENIC PROTEIN" 
/product= M hOP2-PP" 

5 /note= "hOP2 (cDNA) M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 9: 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 



43 
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15 



20 



25 



30 



35 



40 



45 



50 



GGCGCCGGCA GAGCAGGAGT GGCTGGAGGA GCTGTGGTTG GAGCAGGAGG TGGCACGGCA 

GGGCTGGAGG GCTCCCTATG AGTGGCGGAG ACGGCCCAGG AGGCGCTGGA GCAACAGCTC 

CCACACCGCA CCAAGCGGTG GCTGCAGGAG CTCGCCCATC GCCCCTGCGC TGCTCGGACC 

GCGGCCACAG CCGGACTGGC GGGTACGGCG GCGACAGAGG CATTGGCCGA GAGTCCCAGT 

CCGCAGAGTA GCCCCGGCCT CGAGGCGGTG GCGTCCCGGT CCTCTCCGTC CAGGAGCCAG 

GACAGGTGTC GCGCGGCGGG GCTCCAGGGA CCGCGCCTGA GGCCGGCTGC CCGCCCGTCC 

CGCCCCGCCC CGCCGCCCGC CGCCCGCCGA GCCCAGCCTC CTTGCCGTCG GGGCGTCCCC 

AGGCCCTGGG TCGGCCGCGG AGCCGATGCG CGCCCGCTGA GCGCCCCAGC TGAGCGCCCC 

CGGCCTGCC ATG ACC GCG CTC CCC GGC CCG CTC TGG CTC CTG GGC CTG 
Met Thr Ala Leu Pro Gly Pro Leu Trp Leu Leu Gly Leu 
15 10 

GCG CTA TGC GCG CTG GGC GGG GGC GGC CCC GGC CTG CGA CCC CCG CCC 

Ala Leu Cys Ala Leu Gly Gly Gly Gly Pro Gly Leu Arg Pro Pro Pro 

15 20 25 

GGC TGT CCC CAG CGA CGT CTG GGC GCG CGC GAG CGC CGG GAC GTG CAG 

Gly Cys Pro Gin Arg Arg Leu Gly Ala Arg Glu Arg Arg Asp Val Gin 

30 35 40 45 

CGC GAG ATC CTG GCG GTG CTC GGG CTG CCT GGG CGG CCC CGG CCC CGC 

Arg Glu lie Leu Ala Val Leu Gly Leu Pro Gly Arg Pro Arg Pro Arg 

50 55 60 

GCG CCA CCC GCC GCC TCC CGG CTG CCC GCG TCC GCG CCG CTC TTC ATG 

Ala Pro Pro Ala Ala Ser Arg Leu Pro Ala Ser Ala Pro Leu Phe Met 

65 70 75 

CTG GAC CTG TAC CAC GCC ATG GCC GGC GAC GAC GAC GAG GAC GGC GCG 

Leu Asp Leu Tyr His Ala Met Ala Gly Asp Asp Asp Glu Asp Gly Ala 

80 85 90 

CCC GCG GAG CGG CGC CTG GGC CGC GCC GAC CTG GTC ATG AGC TTC GTT 

Pro Ala Glu Arg Arg Leu Gly Arg Ala Asp Leu Val Met Ser Phe Val 

95 100 105 

AAC ATG GTG GAG CGA GAC CGT GCC CTG GGC CAC CAG GAG CCC CAT TGG 

Asn Met Val Glu Arg Asp Arg Ala Leu Gly His Gin Glu Pro His Trp 

110 115 120 125 

AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC CCG GCT GGG GAG GCG GTC 

Lys Glu Phe Arg Phe Asp Leu Thr Gin lie Pro Ala Gly Glu Ala Val 

130 135 140 

ACA GCT GCG GAG TTC CGG ATT TAC AAG GTG CCC AGC ATC CAC CTG CTC 

Thr Ala Ala Glu Phe Arg lie Tyr Lys Val Pro Ser lie His Leu Leu 

145 150 155 

AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG GTG GTC CAG GAG CAG TCC 

Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val Gin Glu Gin Ser 

160 165 170 



60 

120 

180. 

240 

300 

360 

420 

480 

528 

576 

624 

672 

720 

768 

816 

864 

912 

960 

1008 



44 
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40 



45 



50 



55 



AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT CTT CAG ACG CTC CGA GCT 1056 

Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala 

175 180 185 

GGA GAC GAG GGC TGG CTG GTG CTG GAT GTC ACA GCA GCC AGT GAC TGC 1104 

Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cys 

190 195 200 205 

TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA CTC CGC CTC TAT GTG GAG 1152 

Trp Leu Leu Lys Arg His Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu 

210 215 220 

ACT GAG GAC GGG CAC AGC GTG GAT CCT GGC CTG GCC GGC CTG CTG GGT 1200 

Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala Gly Leu Leu Gly 

225 230 235 

CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC GTG GTC ACT TTC TTC AGG. 124 8 

Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val Thr Phe Phe Arg 

240 245 250 

GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG GCA GTG AGG CCA CTG AGG 1296 

Ala Ser Pro Ser Pro lie Arg Thr Pro Arg Ala Val Arg Pro Leu Arg 

255 260 265 

AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG CCG CAG GCC AAC CGA CTC 1344 

Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Arg Leu 

270 275 280 285 

CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC CAC GGC CGG CAG GTC TGC 13 92 

Pro Gly lie Phe Asp Asp Val His Gly Ser His Gly Arg Gin Val Cys 

290 295 300 

CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG GAC CTC GGC TGG CTG GAC 144 0 

Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp 

305 310 315 

TGG GTC ATC GCT CCC CAA GGC TAC TCG GCC TAT TAC TGT GAG GGG GAG 14 88 

Trp Val lie Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu 

320 325 330 

TGC TCC TTC CCA CTG GAC TCC TGC ATG AAT GCC ACC AAC CAC GCC ATC 1536 

Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala lie 

335 340 345 

CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA AAC GCA GTC CCC AAG GCG 1584 

Leu Gin Ser Leu Val His Leu Met Lys Pro Asn Ala Val Pro Lys Ala 

350 355 360 365 

TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC TCT GTG CTC TAC TAT GAC 1632 

Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp 

370 375 380 

AGC AGC AAC AAC GTC ATC CTG CGC AAA CAC CGC AAC ATG GTG GTC AAG 16 80 

Ser Ser Asn Asn Val lie Leu Arg Lys His Arg Asn Met Val Val Lys 

385 390 395 

GCC TGC GGC TGC CAC TGAGTCAGCC CGCCCAGCCC TACTGCAG 1723 

Ala Cys Gly Cys His 
400 



45 




(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
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45 



50 



Met Thr Ala Leu Pro Gly Pro Leu 
1 5 

Ala Leu Gly Gly Gly Gly Pro Gly 
20 

Gin Arg Arg Leu Gly Ala Arg Glu 
35 40 

Leu Ala Val Leu Gly Leu Pro Gly 
50 55 

Ala Ala Ser Arg Leu Pro Ala Ser 
65 70 

Tyr His Ala Met Ala Gly Asp Asp 
85 

Arg Arg Leu Gly Arg Ala Asp Leu 
100 

Glu Arg Asp Arg Ala Leu Gly His 
115 120 

Arg Phe Asp Leu Thr Gin lie Pro 
130 135 

Glu Phe Arg lie Tyr Lys Val Pro 
145 150 

Leu His Val Ser Met Phe Gin Val 
165 

Ser Asp Leu Phe Phe Leu Asp Leu 
180 

Gly Trp Leu Val Leu Asp Val Thr 
195 200 

Lys Arg His Lys Asp Leu Gly Leu 
210 215 

Gly His Ser Val Asp Pro Gly Leu 
225 230 

Pro Arg Ser Gin Gin Pro Phe Val 
245 



Trp Leu Leu Gly Leu Ala Leu Cys 
10 15 

Leu Arg Pro Pro Pro Giy Cys Pro 
25 30 

Arg Arg Asp val Gin Arg Glu He 
45 

Arg Pro Arg Pro Arg Ala Pro Pro 
60 

Ala Pro Leu Phe Met Leu Asp Leu 
75 80 

Asp Glu Asp Gly Ala Pro Ala Glu 
90 95 

Val Met Ser Phe Val Asn Met Val 
105 110 

Gin Glu Pro His Trp Lys Glu Phe 
125 

Ala Gly Glu Ala Val Thr Ala Ala 
140 

Ser lie His Leu Leu Asn Arg Thr 
155 160 

Val Gin Glu Gin Ser Asn Arg Glu 
170 175 

Gin Thr Leu Arg Ala Gly Asp Glu 
185 190 

Ala Ala Ser Asp Cys Trp Leu Leu 
205 

Arg Leu Tyr Val Glu Thr Glu Asp 
220 

Ala Gly Leu Leu Gly Gin Arg Ala 
235 240 

Val Thr Phe Phe Arg Ala Ser Pro 
250 255 



Ser Pro lie Arg Thr Pro Arg Ala Val Arg Pro Leu Arg Arg Arg Gin 
260 265 270 



55 



47 
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Pro Lys Lys Ser Asn Glu Leu Pro Gin Ala Asn Arg Leu Pro Gly lie 
275 280 285 

Phe Asp Asp Val His Gly Ser His’ Gly Arg Gin Val Cys Arg Arg His 
290 295 300 



Glu Leu Tyr Val 
305 

Ala Pro Gin Gly 



Pro Leu Asp Ser 
340 

Leu Val His Leu 
355 

Pro Thr Lys Leu 
370 

Asn Val lie Leu 
385 

Cys His 



(2) INFORMATION FOR SEQ ID 



Ser Phe Gin Asp Leu Gly 
310 

Tyr Ser Ala Tyr Tyr Cys 
325 330 

Cys Met Asn Ala Thr Asn 
345 

Met Lys Pro Asn Ala Val 
360 

Ser Ala Thr Ser Val Leu 
375 

Arg Lys His Arg Asn Met 
390 



>: 21 : 



Trp Leu Asp Trp Val He 
315 320 

Glu Gly Glu Cys Ser Phe 
335 

His Ala lie Leu Gin Ser 
350 

Pro Lys Ala Cys Cys Ala 
365 

Tyr Tyr Asp Ser Ser Asn 
380 

Val Val Lys Ala Cys Gly 
395 400 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 1 926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: MURIDAE 
(F) TISSUE TYPE: EMBRYO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 93.. 1289 

(D) OTHER INFORMATION: /function^ "OSTEOGENIC PROTEIN" 
/product= "mOP2-PP" 

/note= "mOP2 cDNA" 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 



55 



48 
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10 



GCCAGGCACA GGTGCGCCGT CTGGTCCTCC CCGTCTGGCG TCAGCCGAGC CCGACCAGCT 6 0 

ACCAGTGGAT GCGCGCC GGC TGAAAGTCCG AG ATG GCT ATG CGT CCC GGG CCA 113 

Met Ala Met Arg Pro Gly Pro 
1 5 

CTC TGG CTA TTG GGC CTT GCT CTG TGC GCG CTG GGA GGC GGC CAC GGT . 161 

Leu Trp Leu Leu Gly Leu Ala Leu Cys Ala Leu Gly Gly Gly His Gly 

10 15 20 

CCG CGT CCC CCG CAC ACC TGT CCC CAG CGT CGC CTG GGA GCG CGC GAG 2 09 



15 



20 



25 



30 



35 



40 



45 



50 



55 



49 
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Pro 


Arg 


Pro 


Pro 


His 


Thr 






25 












CGC 


CGC 


GAC 


ATG 


CAG 


CGT 


5 


Arg 


Arg 


Asp 


Met 


Gin 


Arg 




40 










45 




CGG 


CCC 


CGA 


CCC 


CGT 


GCA 




Arg 


Pro 


Arg 


Pro 


Arg Ala 


10 










60 






GCG 


CCC 


CTC 


TTC 


ATG 


TTG 




Ala 


Pro 


Leu 


Phe 


Met 


Leu 










75 






15 


GAC 


GGC 


GGG 


CCA 


CCA 


CAG 




Asp Gly Gly 


Pro 


Pro 


Gin 








90 










AGC 


TTC 


GTC 


AAC 


ATG 


GTG 


20 


Ser 


Phe 

105 


Val 


Asn 


Met 


Val 




CCA 


CAC 


TGG 


AAG 


GAA 


TTC 




Pro 


His 


Trp 


Lys 


Glu 


Phe 




120 










125 


25 


GAG 


GCT 


GTC 


ACA 


GCT 


GCT 




Glu 


Ala 


Val 


Thr 


Ala 


Ala 












140 




30 


CAC 


CCG 


CTC 


AAC 


ACA 


ACC 


His 


Pro 


Leu 


Asn 


Thr 


Thr 










155 








GAG 


CAC 


TCC 


AAC 


AGG 


GAG 




Glu 


His 


Ser 


Asn 


Arg 


Glu 


35 






170 










CTC 


CGA 


TCT 


GGG 


GAC 


GAG 




Leu 


Arg 


Ser 


Gly Asp Glu 






185 










40 


AGT 


GAC 


CGA 


TGG 


CTG 


CTG 




Ser 


Asp 


Arg 


Trp 


Leu 


Leu 




200 










205 




TAT 


GTG 


GAA 


ACC 


GCG 


GAT 


45 


Tyr 


Val 


Glu 


Thr 


Ala 


Asp 










220 






CTG 


CTT 


GGA 


CGA 


CAA 


GCA 




Leu 


Leu Gly 


Arg 


Gin 


Ala 










235 






50 


TTC 


TTC 


AGG 


GCC 


AGC 


CAG 




Phe 


Phe 


Arg 


Ala 


Ser 


Gin 








250 










CCA 


CTG 


AAG 


AGG 


AGG 


CAG 


55 


Pro 


Leu 


Lys 


Arg Arg 


Gin 



265 



Cys Pro Gin Arg Arg Leu Gly Ala Arg Glu 
30 35 

GAA ATC CTG GCG GTG CTC GGG CTA CCG GGA 

Glu lie Leu Ala Val Leu Gly Leu Pro Gly 

50 55 

CAA CCC GCC. GCT GCC CGG CAG CCA GCG TCC 

Gin Pro Ala Ala Ala Arg Gin Pro Ala Ser 

65 70 

GAC CTA TAC CAC GCC ATG ACC GAT GAC GAC 

Asp Leu Tyr His Ala Met Thr Asp Asp Asp 

80 85 

GCT CAC TTA GGC CGT GCC GAC CTG GTC ATG 

Ala His Leu Gly Arg Ala Asp Leu Val Met 

95 100 

GAA CGC GAC CGT ACC CTG GGC TAC CAG GAG 

Glu Arg Asp Arg Thr Leu Gly Tyr Gin Glu 

110 115 

CAC TTT GAC CTA ACC CAG ATC CCT GCT GGG 

His Phe Asp Leu Thr Gin lie Pro Ala Gly 

130 135 

GAG TTC CGG ATC TAC AAA GAA CCC AGC ACC 

Glu Phe Arg lie Tyr Lys Glu Pro Ser Thr 

145 150 

CTC CAC ATC AGC ATG TTC GAA GTG GTC CAA 

Leu His He Ser Met Phe Glu Val Val Gin 

160 165 

TCT GAC TTG TTC TTT TTG GAT CTT CAG ACG 

Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr 

175 180 

GGC TGG CTG GTG CTG GAC ATC ACA GCA GCC 

Gly Trp Leu Val Leu Asp lie Thr Ala Ala 

190 195 

AAC CAT CAC AAG GAC CTG GGA CTC CGC CTC 

Asn His His Lys Asp Leu Gly Leu Arg Leu 

210 215 

GGG CAC AGC ATG GAT CCT GGC CTG GCT GGT 

Gly His Ser Met Asp Pro Gly Leu Ala Gly 

225 230 

CCA CGC TCC AGA CAG CCT TTC ATG GTA ACC 

Pro Arg Ser Arg Gin Pro Phe Met Val Thr 

240 245 

AGT CCT GTG CGG GCC CCT CGG GCA GCG AGA 

Ser Pro Val Arg Ala Pro Arg Ala Ala Arg 

255 260 

CCA AAG AAA ACG AAC GAG CTT CCG CAC CCC 

Pro Lys Lys Thr Asn Glu Leu Pro His Pro 

270 275 



257 



305 



353 



401 



449 



497 



545 



593 



641 



689 



737 



76 5 



83 3 



881 



929 



50 
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AAC 


AAA 


CTC 


CCA 


GGG 


ATC 


TTT 


GAT 


GAT 


GGC 


CAC 


GGT 


TCC CGC 


GGC 


AGA 


977 


A sn 
280 


Lys 


Leu 


Pro 


Gly 


He 

285 


Phe 


Asp Asp 


Gly 


His 

290 


Gly ser Arg 


Gly 


Arg 

295 




GAG 


GTT 


TGC 


CGC 


AGG 


CAT 


GAG 


CTC 


TAC 


GTC 


AGC 


TTC 


CGT GAC 


CTT 


GGC 


1025 


Glu 


Val 


Cys 


Arg 


Arg 

300 


His 


Glu 


Leu Tyr 


Val 

305 


Ser 


Phe 


Arg Asp 


Leu 

310 


Gly 


TGG 


CTG 


GAC 


TGG 


GTC 


ATC 


GCC 


CCC 


CAG 


GGC 


TAC 


TCT 


GCC TAT 


TAC 


TGT 


1073 


Trp 


Leu 


Asp 


Trp 

315 


Val 


lie 


Ala 


Pro 


Gin 

320 


Gly 


Tyr 


Ser 


Ala Tyr 
325 


Tyr 


Cys 




GAG 


GGG 


GAG 


TGT 


GCT 


TTC 


CCA 


CTG 


GAC 


TCC 


TGT 


ATG 


AAC GCC 


ACC 


AAC 


1121 


Glu Gly 


Glu 

330 


Cys 


Ala 


Phe 


Pro 


Leu 

335 


Asp 


Ser 


Cys 


Met 


Asn Ala 
340 


Thr 


Asn 




CAT 


GCC 


ATC 


TTG 


CAG 


TCT 


CTG 


GTG 


CAC 


CTG 


ATG 


AAG 


CCA GAT 


GTT 


GTC 


1169 


His 


Ala 

345 


He 


Leu 


Gin 


Ser 


Leu 

350 


Val 


His 


Leu 


Met 


Lys 

355 


Pro Asp 


Val 


Val 




CCC 


AAG 


GCA 


TGC 


TGT 


GCA 


CCC 


ACC 


AAA 


CTG 


AGT 


GCC 


ACC TCT 


GTG 


CTG 


1217 


Pro 

360 


Lys 


Ala 


Cys 


Cys 


Ala 

365 


Pro 


Thr 


Lys 


Leu 


Ser 

370 


Ala 


Thr Ser 


Val 


Leu 

375 




TAG 


TAT 


GAC 


AGC 


AGC 


AAC 


AAT 


GTC 


ATC 


CTG 


CGT 


AAA 


CAC CGT 


AAC 


ATG 


1265 


Tyr Tyr 


Asp 


Ser 


Ser 

380 


Asn 


Asn 


Val 


lie 


Leu 

385 


Arg 


Lys 


His Arg 


Asn 

390 


Met 




GTG 

Val 


GTC 

Val 


AAG 

Lys 


GCC 

Ala 

395 


TGT 

Cys 


GGC 

Gly 


TGC 

Cys 


CAC 

His 


TGAGGCCCCG CCCAGCATCC TGCTTCTACT 


1319 



ACCTTACCAT CTGGCCGGGC CCCTCTCCAG AGGCAGAAAC CCTTCTATGT TATCATAGCT 1379 
CAGACAGGGG CAATGGGAGG CCCTTCACTT CCCCTGGCCA CTTCCTGCTA AAATTCTGGT 143 9 
CTTTCCCAGT TCCTGTGTCC TTCATGGGGT TTCGGGGCTA TCACCCCGCC CTCTCCATCC 1499 
TCCTACCCCA AGCATAGACT GAATGCACAC AGCATCCCAG AGCTATGCTA ACTGAGAGGT 1559 
CTGGGGTCAG CACTGAAGGC CCACATGAGG AAGACTGATC CTTGGCCATC CTCAGCCCAC 1619 
AATGGCAAAT TCTGGATGGT CTAAGAAGGC CCTGGAATTC TAAACTAGAT GATCTGGGCT 167 9 
CTCTGCACCA TTCATTGTGG CAGTTGGGAC ATTTTTAGGT ATAACAGACA CATACACTTA 1739 
GATCAATGCA TCGCTGTACT CCTTGAAATC AGAGCTAGCT TGTTAGAAAA AGAATCAGAG 1799 
CCAGGTATAG CGGTGCATGT CATTAAT CCC AGCGCTAAAG AGACAGAGAC AGGAGAATCT 185 9 
CTGTGAGTTC AAGGCCACAT AGAAAGAGCC TGTCTCGGGA GCAGGAAAAA AAAAAAAAAC 1919 
GGAATTC 1926 



55 



(2) INFORMATION FOR SEQ ID NO:22: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 399 amino acids 



51 




(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Met Ala Met Arg Pro Gly Pro Leu 
1 5 

Ala Leu Gly Gly Gly His Gly Pro 
20 

Arg Arg Leu Gly Ala Arg Glu Arg 
35 40 

Ala Val Leu Gly Leu Pro Gly Arg 
50 55 

Ala Ala Arg Gin Pro Ala Ser Ala 
65 70 

His Ala Met Thr Asp Asp Asp Asp 
85 



Trp Leu Leu Gly Leu Ala Leu Cys 
10 15 

Arg Pro Pro His Thr Cys Pro Gin 
25 30 

Arg Asp Met Gin Arg Glu lie Leu 
45 

Pro Arg Pro Arg Ala Gin Pro Ala 
60 

Pro Leu Phe Met Leu Asp Leu Tyr 
75 . 80 

Gly Gly Pro Pro Gin Ala His Leu 
90 95 



Gly Arg Ala Asp Leu Val Met Ser Phe Val Asn Met Val Glu Arg Asp 
100 105 110 

Arg Thr Leu Gly Tyr Gin Glu Pro His Trp Lys Glu Phe His Phe Asp 
115 120 125 

Leu Thr Gin lie Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg 
130 135 140 

He Tyr Lys Glu Pro Ser Thr His Pro Leu Asn Thr Thr Leu His lie 

145 150 155 160 

Ser Met Phe Glu Val Val Gin Glu His Ser Asn Arg Glu Ser Asp Leu 

165 170 175 

Phe Phe Leu Asp Leu Gin Thr Leu Arg Ser Gly Asp Glu Giy Trp Leu 
180 185 190 

Val Leu Asp lie Thr Ala Ala Ser Asp Arg Trp Leu Leu Asn His His 
195 200 205 

Lys Asp Leu Gly Leu Arg Leu Tyr Val Glu Thr Ala Asp Gly His Ser 
210 215 220 

Met Asp Pro Gly Leu Ala Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser 

225 230 235 240 

Arg Gin Pro Phe Met Val Thr Phe Phe Arg Ala Ser Gin Ser Pro Val 

245 250 255 

Arg Ala Pro Arg Ala Ala Arg Pro Leu Lys Arg Arg Gin Pro Lys Lys 
260 265 270 

Thr Asn Glu Leu Pro His Pro Asn Lys Leu Pro Gly lie Phe Asp Asp 
275 280 285 

Gly His Gly Ser Arg Gly Arg Glu Val Cys Arg Arg His Glu Leu Tyr 
290 295 300 

Val Ser Phe Arg Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin 



53 
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305 



310 



315 



10 



15 



320 



Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asp 
325 330 335 

Ser Cys Met Asn Ala Thr Asn His Ala lie Leu Gin Ser Leu Val His 
340 345 350 

Leu Met Lys Pro Asp Val Val Pro Lys Ala Cys Cys Ala Pro Thr Lys 
355 360 365 

Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val lie 
370 375 380 

Leu Arg Lys His Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
385 390 395 



20 



(2) INFORMATION FOR SEQ ID NO:23: 
(i) SEQUENCE CHARACTERISTICS: 



25 



(A) LENGTH: 1368 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 



35 



(A) NAME/KEY: CDS 

(B) LOCATION: 1.. 1365 

(D) OTHER INFORMATION: /labe!= 60A 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



40 



45 



50 



55 



54 
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10 



15 




25 



ATG TCG GGA CTG CGA AAC ACC TCG 

Met Ser Gly Leu Arg Asn Thr Ser 

1 5 

CTG GGA CTC GGA ATG GTT CTG CTC 

Leu Gly Leu Gly Met Val Leu Leu 

20 

GCC GTT GAG GCC ACC CAG TCG GGG 

Ala Val Glu Ala Thr Gin Ser Gly 

35 40 

CAG ACG ATC ATG CAC AGA GTG CTG 

Gin Thr lie Met His Arg Val Leu 

50 55 

TCG TAC GAG ATC CTC GAG TTC CTG 

Ser Tyr Glu lie Leu Glu Phe Leu 

65 70 

CTG AGC AGC CAC CAG TTG TCG CTG 

Leu Ser Ser His Gin Leu Ser Leu 

85 

CTG GAC GTC TAC CAC CGC ATC ACG 



GAG GCC GTT GCA GTG CTC GCC TCC 

Glu Ala Val Ala Val Leu Ala Ser 

10 15 

ATG TTC GTG GCG ACC ACG CCG CCG 

Met Phe Val Ala Thr Thr Pro Pro 

25 30 

ATT TAC ATA GAC AAC GGC AAG GAC 

lie Tyr lie Asp Asn Gly Lys Asp 

45 

AGC GAG GAC GAC AAG CTG GAC GTC 

Ser Glu Asp Asp Lys Leu Asp Val 

60 

GGC ATC GCC GAA CGG CCG ACG CAC 

Gly He Ala Glu Arg Pro Thr His 

75 80 

AGG AAG TCG GCT CCC AAG TTC CTG 

Arg Lys Ser Ala Pro Lys Phe Leu 

90 95 

GCG GAG GAG GGT CTC AGC GAT CAG 



48 



96 



144 



192 



, 240 



268 



3 36 



30 



35 




40 



45 



50 



55 



55 
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50 



Leu Asp Val Tyr 
100 

GAT GAG GAC GAC 
Asp Glu Asp Asp 
115 

GAC CTC GAG GAG 
Asp Leu Glu Glu 
130 

CTG GAC AAG CGG 
Leu Asp Lys Arg 
145 

AAC AAG CGC CAC 
Asn Lys Arg His 



CGC CTG TGG TTC 
Arg Leu Trp Phe 
180 

ATG GCC GAG CTG 
Met Ala Glu Leu 
195 

ACC GCC AAC AGG 
Thr Ala Asn Arg 
210 

ACG CTG GGC CAG 
Thr Leu Gly Gin 
225 

GGG GAC TAC GTG 
Gly Asp Tyr Val 



GAG TGG CTG GTC 
Glu Trp Leu Val 
260 

CAC GCT GTC AAC 
His Ala Val Asn 
275 

CTG ATC CAC CGC 
Leu lie His Arg 
290 

TTC TTC CGC GGA 
Phe Phe Arg Gly 
305 

CAC AGG AGC AAG 
His Arg Ser Lys 



GTG TCG CCC AAC 
Val Ser Pro Asn 
340 



His Arg lie Thr 



GAC TAC GAA CGC 
Asp Tyr Glu Arg 
120 

GAT GAG GGC GAG 
Asp Glu Gly Glu 
135 

GCC ATC GAC GAG 
Ala lie Asp Glu 
150 

CAC AAT GTG GAC 
His Asn Val Asp 
165 

GAC GTC TCC AAC 
Asp Val Ser Asn 



CGC ATC TAT CAG 
Arg lie Tyr Gin 
200 

GAG TTC ACC ATC 
Glu Phe Thr lie 
215 

CAC ACC ATG GAG 
His Thr Met Glu 
230 

GGC TGG TTG GAG 
Gly Trp Leu Glu 
245 

AAG TCG AAG GAC 
Lys Ser Lys Asp 



CGA CCC GAC CGC 
Arg Pro Asp Arg 
280 

AAG GTG GAC GAC 
Lys Val Asp Asp 
295 

CCG GAG CTG ATC 
Pro Glu Leu lie 
310 

CGA AGC GCC AGC 
Arg Ser Ala Ser 
325 

AAC GTG CCG CTG 
Asn Val Pro Leu 



Ala Glu Glu Gly 
105 

GGC CAT CGG TCC 
Gly His Arg Ser 

CAG CAG AAG AAC 
Gin Gin Lys Asn 
140 

AGC GAC ATC ATC 
Ser Asp lie lie 
155 

GAA CTG CGT CAC 
Glu Leu Arg His 
17 0 

GTG CCC AAC GAC 
Val Pro Asn Asp 
185 

AAC GCC AAC GAG 
Asn Ala Asn Glu 



ACG GTA TAC GCC 
Thr Val Tyr Ala 
220 

CCG CTG TCC TCG 
Pro Leu Ser Ser 
235 

CTC AAC GTG ACC 
Leu Asn Val Thr 
250 

AAT CAT GGC ATC 
Asn His Gly lie 
265 

GAG GTG AAG CTG 
Glu Val Lys Leu 



GAG TTC CAG CCC 
Glu Phe Gin Pro 
300 

AAG GCG ACG GCC 
Lys Ala Thr Ala 
315 

CAT CCA CGC AAG 
His Pro Arg Lys 
330 

CTG GAA CCG ATG 
Leu Glu Pro Met 
345 



Leu Ser Asp Gin 
110 

AGG AGG AGC GCC 
Arg Arg Ser- Ala 
125 

TTC ATC ACC GAC 
Phe lie Thr Asp 



ATG ACC TTC CTG 
Met Thr Phe Leu 
160 

GAG CAC GGC CGT 
G|lu His Gly Arg 
175 

AAC TAC CTG GTG 
Asn Tyr Leu Val 
190 

GGC AAG TGG CTG 
Gly Lys Trp Leu 
205 

ATT GGC ACC GGC 
lie Gly Thr Gly 



GTG AAC ACC ACC 
Val Asn Thr Thr 
240 

GAG GGC CTG CAC 
Glu Gly Leu His 
255 

TAC ATT GGA GCA 
Tyr lie Gly Ala 
270 

GAC GAC ATT GGA 
Asp Asp lie Gly 
285 

TTC ATG ATC GGC 
Phe Met lie Gly 



CAC AGC AGC CAC 
His Ser Ser His 
320 

CGC AAG AAG TCG 
Arg Lys Lys Ser 
335 

GAG AGC ACG CGC 
Glu Ser Thr Arg 
350 



384 



432 



460 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



56 
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AGC 


TGC 


CAG 


ATG 


CAG 


ACC 


CTG 


TAC 


ATA 


GAC 


TTC 


AAG 


GAT 


CTG 


GGC 


TGG 


1104 


Ser 


Cys 


Gin 


Met 


Gin 


Thr 


Leu Tyr 


lie 


Asp 


Phe 


Lys Asp 


Leu Gly Trp 








355 










360 










365 










CAT 


GAC 


TGG 


ATC 


ATC 


GCA 


CCA 


GAG 


GGC 


TAT 


GGC 


GCC 


TTC 


TAC 


TGC 


AGC 


1152 


His 


Asp Trp 


lie 


lie 


Ala 


Pro 


Glu Gly Tyr Gly Ala 


Phe 


Tyr Cys 


Ser 






370 










375 










360 












GGC 


GAG 


TGC 


AAT 


TTC 


CCG 


CTC 


AAT 


GCG 


CAC 


ATG 


AAC 


GCC 


ACG 


AAC 


CAT 


1200 


Gly 


Glu 


Cys 


Asn 


Phe 


Pro 


Leu 


Asn 


Ala 


His 


Met 


Asn 


Ala 


Thr 


Asn 


His 




385 










390 










395 










400 




GCG 


ATC 


GTC 


CAG 


ACC 


CTG 


GTC 


CAC 


CTG 


CTG 


GAG 


CCC 


AAG 


AAG 


GTG 


CCC 


1248 


Ala 


He 


Val 


Gin 


Thr 


Leu 


Val 


His 


Leu 


Leu 


Glu 


Pro Lys 


Lys 


Val 


Pro 












405 










410 










415 






AAG 


CCC 


TGC 


TGC 


GCT 


CCG 


ACC 


AGG 


CTG 


GGA 


GCA 


CTA 


CCC 


GTT 


CTG 


TAC 


1296 


Lys 


Pro 


Cys 


Cys 


Ala 


Pro 


Thr 


Arg 


Leu Gly Ala 


Leu 


Pro 


Val 


Leu 


Tyr 










420 










425 










430 






CAC 


CTG 


AAC 


GAC 


GAG 


AAT 


GTG 


AAC 


CTG 


AAA 


AAG 


TAT 


AGA 


AAC 


ATG 


ATT 


1344 


His 


Leu 


Asn 


Asp 


Glu 


Asn 


Val 


Asn 


Leu 


Lys 


Lys 


Tyr Arg Asn Met 


lie 








435 










440 










445 










GTG 


AAA 


TCC 


TGC 


GGG 


TGC 


CAT 


TGA 


















1368 


Val 


Lys 


Ser 


Cys 


Gly 


Cys 


His 























450 455 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 



57 
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20 



Met Ser Gly Leu Arg Asn Thr Ser Glu Ala Val Ala Val Leu Ala Ser 

1 5 10 15 

Leu Gly Leu Gly Met Val Leu Leu Met Phe Val Ala Thr Thr Pro Pro 

20 25 30 

Ala Val Glu Ala Thr Gin Ser Gly He Tyr lie Asp Asn Gly Lys Asp 
35 40 45 

Gin Thr lie Met His Arg Val Leu Ser Glu Asp Asp Lys Leu Asp Val 
50 55 60 

Ser Tyr Glu lie Leu Glu Phe Leu Gly lie Ala Glu Arg Pro Thr His 

65 70 75 80 

Leu Ser Ser His Gin Leu Ser Leu Arg Lys Ser Ala Pro Lys Phe Leu 

85 90 95 

Leu Asp Val Tyr His Arg lie Thr Ala Glu Glu Gly Leu Ser Asp Gin 

100 105 110 



25 



30 



35 



40 



45 



50 



55 



58 
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Asp Glu Asp Asp Asp Tyr Glu Arg Gly His Arg Ser Arg Arg Ser Ala 
115 120 125 

Asp Leu Glu Glu Asp Glu Gly Glu Gin Gin Lys Asn Phe lie Thr Asp 
130 135 140 

Leu Asp Lys Arg Ala lie Asp Glu Ser Asp He lie Met Thr Phe Leu 

145 150 155 160 

His Gly Arg 
175 

Tyr Leu Val 
190 

Met Ala Glu Leu Arg lie Tyr Gin Asn Ala Asn Glu Gly Lys Trp Leu 
195 200 205 

Thr Ala Asn Arg Glu Phe Thr lie Thr Val Tyr Ala lie Gly Thr Gly 
210 215 220 

Thr Leu Gly Gin His Thr Met Glu Pro Leu Ser Ser Val Asn Thr Thr 

225 230 235 240 

Gly Asp Tyr Val Gly Trp Leu Glu Leu Asn Val Thr Glu Gly Leu His 

245 250 255 

Glu Trp Leu Val Lys Ser Lys Asp Asn His Gly lie Tyr lie Gly Ala 
260 265 270 

His Ala Val Asn Arg Pro Asp Arg Glu Val Lys Leu Asp Asp lie Gly 
275 280 285 

Leu lie His Arg Lys Val Asp Asp Glu Phe Gin Pro Phe Met lie Gly 
290 295 300 

Phe Phe Arg Gly Pro Glu Leu lie Lys Ala Thr Ala His Ser Ser His 

305 310 315 320 

His Arg Ser Lys Arg Ser Ala Ser His Pro Arg Lys Arg Lys Lys Ser 

325 330 335 . 

Val Ser Pro Asn Asn Val Pro Leu Leu Glu Pro Met Glu Ser Thr Arg 
340 345 350 

Ser Cys Gin Met Gin Thr Leu Tyr lie Asp Phe Lys Asp Leu Gly Trp 
355 360 365 

His Asp Trp lie lie Ala Pro Glu Gly Tyr Gly Ala Phe Tyr Cys Ser 
370 375 380 

Gly Glu Cys Asn Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His 

385 390 395 400 

Ala lie Val Gin Thr Leu Val His Leu Leu Glu Pro Lys Lys Val Pro 

405 410 415 

Lys Pro Cys Cys Ala Pro Thr Arg Leu Gly Ala Leu Pro Val Leu Tyr 
420 425 430 

His Leu Asn Asp Glu Asn Val Asn Leu Lys Lys Tyr Arg Asn Met lie 
435 440 445 



Asn Lys Arg His His Asn Val Asp Glu Leu Arg His Glu 
165 170 

Arg Leu Trp Phe Asp Val Ser Asn Val Pro Asn Asp Asn 
180 185 



59 
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Val Lys Ser Cys Gly Cys His 
450 455 



(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 69..1265 

(D) OTHER INFORMATION: /note= n mOP3-PP" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



25 



30 



35 ■ 



40 



45 



50 



55 



60 
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15 



20 



25 



30 



35 



40 



GGATGCGCG G CGCTGTCCCA TCCTTGTCGT CGAGGCGTCG CTGGATGCGA GTCCGCTAAA 

CGTCCGAG ATG GCT . GCG CGT CCG GGA CTC CTA TGG CTA CTG GGC CTG GCT 
Met Ala Ala Arg Pro Gly Leu Leu Trp Leu Leu Gly Leu Ala 
1 5 10 

CTG TGC GTG TTG GGC GGC GGT CAC CTC TCG CAT CCC CCG CAC GTC TTT 

Leu Cys Val Leu Gly Gly Gly His Leu Ser His Pro Pro His Val Phe 

15 20 25 30 

CCC CAG CGT CGA CTA GGA GTA CGC GAG CCC CGC GAC ATG CAG CGC GAG 

Pro Gin Arg Arg Leu Gly Val Arg Glu Pro Arg Asp Met Gin Arg Glu 

35 40 45 

ATT CGG GAG GTG CTG GGG CTA GCC GGG CGG CCC CGA TCC CGA GCA CCG 

lie Arg Glu Val Leu Gly Leu Ala Gly Arg Pro Arg Ser Arg Ala Pro 

50 55 60 

GTC GGG GCT GCC CAG CAG CCA GCG TCT GCG CCC CTC TTT ATG TTG GAC 

Val Gly Ala Ala Gin Gin Pro Ala Ser Ala Pro Leu Phe Met Leu Asp 

65 70 75 

CTG TAC CGT GCC ATG ACG GAT GAC AGT GGC GGT GGG ACC CCG CAG CCT 

Leu Tyr Arg Ala Met Thr Asp Asp Ser Gly Gly Gly Thr Pro Gin Pro 

80 85 90 

CAC TTG GAC CGT GCT GAC CTG ATT ATG AGC TTT GTC AAC ATA GTG GAA 

His Leu Asp Arg Ala Asp Leu He Met Ser Phe Val Asn lie Val Glu 

95 100 105 110 

CGC GAC CGT ACC CTG GGC TAC CAG GAG CCA CAC TGG AAG GAA TTC CAC 

Arg Asp Arg Thr Leu Gly Tyr Gin Glu Pro His Trp Lys Glu Phe His 

.115 120 125 

TTT GAC CTA ACC CAG ATC CCT GCT GGG GAG GCT GTC ACA GCT GCT GAG 

Phe Asp Leu Thr Gin lie Pro Ala Gly Glu Ala Val Thr Ala Ala Glu 

130 135 140 

TTC CGG ATC TAC AAA GAA CCC AGT ACC CAC CCG CTC AAC ACA ACC CTC 

Phe Arg lie Tyr Lys Glu Pro Ser Thr His Pro Leu Asn Thr Thr Leu 



60 

110 

158 

206 

254 

302 

350 

398 

446 

494 

542 




50 



55 



61 




EP 0 81 2 207 B1 



5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



145 150 155 

CAC ATC AGC ATG TTC GAA GTG GTC CAA GAG CAC TCC AAC AGG GAG TCT 
His He Ser Met Phe Glu Val Val Gin Glu His Ser Asn Arg Glu Ser 
160. 165 170 



GAC 


TTG 


TTC 


TTT 


TTG 


GAT 


CTT 


CAG 


Asp 

175 


Leu 


Phe 


Phe 


Leu 


Asp 

180 


Leu 


Gin 


TGG 


CTG 


GTG 


CTG 


GAC 


ATC 


ACA 


GCA 


Trp 


Leu 


Val 


Leu Asp 
195 


lie 


Thr 


Ala 


CAT 


CAC 


AAG 


GAC 


CTA 


GGA 


CTC 


CGC 


His 


His 


Lys 


Asp 

210 


Leu 


Gly 


Leu. 


Arg 


CAC 


AGC 


ATA 


GAT 


CCT 


GGC 


CTA 


GCT 


His 


Ser 


lie 

225 


Asp 


Pro 


Gly 


Leu 


Ala 

230 


CGC 


TCC 


AGA 


CAG. 


CCT 


TTC 


ATG 


GTT 


Arg 


Ser Arg 
240 


Gin 


Pro 


Phe 


Met 

245 


Val 


CCT 


GTG 


CGG 


GCC 


CCT 


CGA 


ACA 


GCA 


Pro 

255 


Val 


Arg 


Ala 


Pro 


Arg 

260 


Thr 


Ala 


AAT 


CAA 


ATC 


AAC 


CAG 


CTG 


CCG 


CAC 


Asn 


Gin 


lie 


Asn 


Gin 

275 


Leu 


Pro 


His 


GAT 


GAT 


GGC 


CAC 


GGT 


TCT 


CAC 


GGC 


Asp Asp 


Gly 


His Gly 
290 


Ser 


His 


Gly 


CTC 


TAT 


GTC 


AGC 


TTC 


CGT 


GAC 


CTT 


Leu 


Tyr 


Val 

305 


Ser 


Phe 


Arg 


Asp 


Leu 

310 


CCC 


CAG 


GGC 


TAC 


TCC 


GCC 


TAT 


TAC 


Pro 


Gin 

320 


Gly 


Tyr 


Ser 


Ala 


Tyr 

325 


Tyr 


CTG 


AAC 


TCC 


TGT 


ATG 


AAC 


TCC 


ACC 


Leu 

335 


Asn 


Ser 


Cys 


Met 


Asn 

340 


Ser 


Thr 


GTA 


CAT 


CTG 


ATG 


AAG 


CCA 


GAT 


ATC 


Val 


His 


Leu 


Met 


Lys 

355 


Pro 


Asp 


lie 


ACT 


GAG 


CTG 


AGT 


GCC 


ATT 


TCT 


CTG 


Thr 


Glu 


Leu 


Ser 

370 


Ala 


lie 


Ser 


Leu 


GTC 


ATC 


CTG 


CGC 


AGG 


GAG 


CGC 


AAC 


Val 


lie 


Leu 

385 


Arg 


Arg 


Glu 


Arg 


Asn 

390 



ACG 


CTC 


CGA 


TCT 


GGG 


GAC 


GAG 


GGC 


Thr 


Leu 


Arg 


Ser 


Gly 


Asp 


Glu 


Gly 






185 










190 


GCC 


AGT 


GAC 


CGA 


TGG 


CTG 


CTG 


AAC 


Ala 


Ser 


Asp Arg 


Trp 


Leu 


Leu 


Asn 




200 










205 




CTC 


TAT 


GTG 


GAA 


ACC 


GAG 


GAT 


GGG 


Leu 


Tyr 


Val 


Glu 


Thr 


Glu 


Asp 


Gly 


215 










220 




GGT 


CTG 


CTT 


GGA 


CGA 


CAA 


GCA 


CCA 


Gly 


Leu 


Leu 


Gly 


Arg 


Gin 


Ala 


Pro 










235 








GGT 


TTC 


TTC 


AGG 


GCC 


AAC 


CAG 


AGT 


Gly 


Phe 


Phe 


Arg 


Ala 


Asn 


Gin 


Ser 








250 










AGA 


CCA 


CTG 


AAG 


AAG 


AAG 


CAG 


CTA 


Arg 


Pro 


Leu 


Lys 


Lys 


Lys 


Gin 


Leu 






265 










270 


TCC 


AAC 


AAA 


CAC 


CTA 


GGA 


ATC 


CTT 


Ser 


Asn 


Lys 


His 


Leu 


Gly 


lie 


Leu 




280 










285 




AGA 


GAA 


GTT 


TGC 


CGC 


AGG 


CAT 


GAG 


Arg 


Glu 


Val 


Cys 


Arg 


Arg 


His 


Glu 


295 










300 






GGC 


TGG 


CTG 


GAC 


TCT 


GTC 


ATT 


GCC 


Gly 


Trp 


Leu Asp 


Ser 


Val 


lie 


Ala 










315 








TGT 


GCT 


GGG 


GAG 


TGC 


ATC 


TAC 


CCA 


Cys 


Ala 


Gly Glu 


Cys 


lie 


Tyr 


Pro 








330 










AAC 


CAC 


GCC 


ACT 


ATG 


CAG 


GCC 


CTG 


Asn 


His 


Ala 


Thr 


Met 


Gin 


Ala 


Leu 






345 










350 


ATC 


CCC 


AAG 


GTG 


TGC 


TGT 


GTG 


CCT 


lie 


Pro 


Lys 


Val 


Cys 


Cys 


Val 


Pro 




360 










365 




CTC 


TAC 


TAT 


GAT 


AGA 


AAC 


AAT 


AAT 


Leu 


Tyr 


Tyr 


Asp 


Arg 


Asn 


Asn 


Asn 


375 










380 






ATG 


GTA 


GTC 


CAG 


GCC 


TGT 


GGC 


TGC 


Met 


Val 


Val 


Gin 


Ala 


Cys 


Gly 


Cys 










395 









CAC TGAGTCCCTG CCCAACAGCC TGCTGCCATC CCATCTATCT AGTCAGGCCT 



590 



638 



686 



734 



782 



830 



878 



926 



974 



1022 



1070 



1118 



1166 



1214 



1262 



1315 



62 




1375 



CTCTTCCAAG GCAGGAAACC AACAAAGAGG GAAGGCAGTG- CTTTCAACTC CATGTCCACA 
TTCACAGTCT TGGCCCTCTC TGTTCTTTTT GCCAAGGCTG AGAAGATGGT CCTAGTTATA 
ACCCTGGTGA CCTCAGTAGC CCGATCTCTC ATCTCCCCAA ACTCCCCAAT GCAGCCAGGG 
GCATCTATGT CCTTTGGGAT TGGGCACAGA AGTCCAATTT ACCAACTTAT TCATGAGTCA 
CTACTGGCCC AGCCTGGACT TGAACCTGGA ACACAGGGTA GAGCTCAGGC TCTTCAGTAT 
CCATCAGAAG ATTTAGGTGT GTGCAGACAT GACCACACTC CCCCTAGCAC TCCATAGCC 



1435 

1495 

1555 

1615 

1674 



(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
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5 



10 



15 



20 



25 



30 



35 



Met Ala Ala Arg Pro Gly Leu Leu 
1 5 

Val Leu Gly Gly Gly His Leu Ser 
20 

Arg Arg Leu Gly Val Arg Glu Pro 
35 40 

Glu Val Leu Gly Leu Ala Gly Arg 
50 55 

Ala Ala Gin Gin Pro Ala Ser Ala 
65 70 

Arg Ala Met Thr Asp Asp Ser Gly 
85 

Asp Arg Ala Asp Leu lie Met Ser 
100 

Arg Thr Leu Gly Tyr Gin Glu Pro 
115 120 

Leu Thr Gin lie Pro Ala Gly Glu 
130 135 

lie Tyr Lys Glu Pro Ser Thr His 
145 150 

Ser Met Phe Glu Val Val Gin Glu 
165 

Phe Phe Leu Asp Leu Gin Thr Leu 



Trp Leu Leu Gly Leu Ala Leu Cys 
10 15 

His Pro Pro His Val Phe Pro Gin 
25 30 

Arg Asp Met Gin Arg Glu lie Arg 
45 

Pro Arg Ser Arg Ala Pro Val Gly 
60 

Pro Leu Phe Met Leu Asp Leu Tyr 
75 00 

Gly Gly Thr Pro Gin Pro His Leu 
90 95 

Phe Val Asn lie Val Glu Arg Asp 
105 HO 

His Trp Lys Glu Phe His Phe Asp 
125 

Ala Val Thr Ala Ala Glu Phe Arg 
140 

Pro Leu Asn Thr Thr Leu His lie 
155 160 

His Ser Asn Arg Glu Ser Asp Leu 
170 175 

Arg Ser Gly Asp Glu Gly Trp Leu 



40 



45 



50 



55 



64 
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180 



5 


Val Leu 


Asp 

195 


lie 


Thr 


Ala 




Lys Asp 
210 


Leu 


Gly Leu 


Arg 


10 


lie Asp 
225 


Pro 


Gly 


Leu 


Ala 

230 




Arg Gin 


Pro 


Phe 


Met 

245 


Val 


15 


Arg Ala 


Pro Arg 
260 


Thr 


Ala 


20 


lie Asn 


Gin 

275 


Leu 


Pro 


His 




Gly His 
290 


Gly Ser 


His 


Gly 


25 


Val Ser 
305 


Phe 


Arg 


Asp 


Leu 

310 




Gly Tyr 


Ser 


Ala 


Tyr Tyr 
325 


30 


Ser Cys 


Met 


Asn 

340 


Ser 


Thr 




Leu Met 


Lys 

355 


Pro 


Asp 


lie 


35 


Leu Ser 
370 


Ala 


lie 


Ser 


Leu 




Leu Arg 
385 


Arg 


Glu 


Arg 


Asn 

390 



40 



(2) INFORMATION FOR SEQ ID NO:27: 

(j) SEQUENCE CHARACTERISTICS: 

45 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: protein 
(ix) FEATURE: 

55 (A) NAME/KEY: Protein 

(B) LOCATION: 1..104 
(D) OTHER INFORMATION: /note= 



185 190 

Ala Ser Asp Arg Trp Leu Leu Asn His His 
* 200 205 

Leu Tyr Val Glu Thr Glu Asp Gly His Ser 
215 220 

Gly Leu Leu Gly Arg Gin Ala Pro Arg Ser 
235 240 

Gly Phe Phe Arg Ala Asn Gin Ser Pro Val 
250 255 

Arg Pro Leu Lys Lys Lys Gin Leu Asn Gin 
265 270 

Ser Asn Lys His Leu Gly lie Leu Asp Asp 
280 285 

Arg Glu Val Cys Arg Arg His Glu Leu Tyr 
295 300 

Gly Trp Leu Asp Ser Val lie Ala Pro Gin 
315 320 

Cys Ala Gly Glu Cys lie Tyr Pro Leu Asn 
330 335 

Asn His Ala Thr Met Gin Ala Leu Val His. 
345 350 

He Pro Lys Val Cys Cys Val Pro Thr Glu 
360 365 

Leu Tyr Tyr Asp Arg Asn Asn Asn Val lie 
375 380 

Met Val Val Gin Ala Cys Gly Cys His 
395 



l BMP3" 



65 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27; 



5 


Cys Ala 


i Arg 


Arg 


Tyr 


Leu 


Lys 


Val 


Asp 


Phe 


Ala 


Asp 


lie 


Gly 


Trp 


Ser 




1 








5 










10 










15 




10 


Glu 


Trp 


lie 


lie 

20 


Ser 


Pro 


Lys 


Ser 


Phe 

25 


Asp Ala 


Tyr 


Tyr Cys 
30 


Ser Gly 




Ala 


Cys 


Gin 

35 


Phe 


Pro 


Met 


Pro 


Lys 

40 


Ser 


Leu 


Lys 


Pro 


Ser 

45 


Asn 


His 


Ala 


15 


Thr 


He 

50 


Gin 


Ser 


lie 


Val 


Ala 

55 


Arg 


Ala 


Val Gly Val 
60 


Val 


Pro Gly 


lie 




Pro 

65 


Glu 


Pro 


Cys 


Cys 


Val 

70 


Pro 


Glu Lys 


Met 


Ser 

75 


Ser 


Leu 


Ser 


lie 


Leu 

80 


20 


Phe 


Phe 


Asp 


Glu 


Asn 

85 


Lys 


Asn 


Val 


Val 


Leu 

90 


Lys Val 


Tyr 


Pro 


Asn 

95 


Met 




Thr 


Val 


Glu 


Ser 

100 


Cys 


Ala 


Cys 


Arg 



















25 

(2) INFORMATION FOR SEQ ID NO:28: 



(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 1 02 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

40 (jx) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 ..102 

(D) OTHER INFORMATION: /note= M BMP5 ,, 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



50 



55 



66 
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10 



15 



20 



25 



30 



35 



40 



Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 

1 5 10 15 

Asp Trp He lie Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly 

20 25 30 

Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
35 40 45 

lie Val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
50 55 60 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe 

65 70 75 80 

Asp Asp Ser Ser Asn Val lie Leu Lys Lys Tyr Arg Asn Met Val Val 

85 90 95 

Arg Ser Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein. 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1.. 102 

(D) OTHER INFORMATION: /note= "BMPe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 



45 



50 



55 



67 
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Cys Arg 
1 


Lys 


His 


Glu 

5 


Leu 


Tyr 


Val 


Ser 


Phe 

10 


Gin Asp 


Leu Gly Trp Gin 
15 


5 


Asp Trp 


He 


lie 

20 


Ala 


Pro 


Lys 


Gly Tyr 
25 


Ala 


Ala 


Asn Tyr 


Cys Asp Gly 
30 




Glu Cys 


Ser 

35 


Phe 


Pro 


Leu 


Asn 


Ala 

40 


His 


Met 


Asn 


Ala 


Thr 
45 . 


Asn 


His Ala 


10 


lie Val 
50 


Gin 


Thr 


Leu 


Val 


His 

55 


Leu 


Met 


Asn 


Pro 


Glu 

60 


Tyr 


Val 


Pro Lys 


15 


Pro Cys 
65 


Cys 


Ala 


Pro 


Thr 

70 


Lys 


Leu 


Asn 


Ala 


lie 

75 


Ser 


Val 


Leu 


Tyr Phe 
00 




Asp Asp 


Asn 


Ser 


Asn 

85 


Val 


lie 


Leu 


Lys 


Lys 

90 


Tyr Arg 


Trp 


Met 


Val Val 
95 


20 


Arg Ala 


Cys 


Gly 

100 


Cys 


His 





















(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

25 

(A) LENGTH: 1247 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

35 (A) ORGANISM: HOMO SAPIENS 

(F) TISSUE TYPE: BRAIN 

(ix) FEATURE: 

40 (A) NAME/KEY: CDS 

(B) LOCATION: 84.. 11 99 

(D) OTHER INFORMATION: /product^ “GDF-1" 

/note= "GDF-1 CDNA" 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 



50 



55 



68 
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GGGGACACCG GCCCCGCCCT CAGCCCACTG GTCCCGGGCC 


GCCGCGGACC CTGCGCACTC 


60 




TCTGGTCATC GCCTGGGAGG AAG ATG CCA CCG CCG CAG CAA GGT CCC TGC 


110 


5 












Met Pro Pro Pro Gin Gin Gly Pro Cys 
1 5 




GGC 


CAC 


CAC 


CTC 


CTC 


CTC CTC 


CTG 


GCC 


CTG 


CTG 


CTG 


CCC 


TCG 


CTG 


CCC 


158 




Gly 


His 


His 


Leu 


Leu 


Leu Leu 


Leu 


Ala 


Leu 


Leu 


Leu 


Pro 


Ser 


Leu 


Pro 


10 


10 










15 








20 










25 






CTG 


ACC 


CGC 


GCC 


CCC 


GTG CCC 


CCA 


GGC 


CCA 


GCC 


GCC 


GCC 


CTG 


CTC 


CAG 


206 




Leu 


Thr 


Arg 


Ala 


Pro 


Val Pro 


Pro 


Gly 


Pro 


Ala 


Ala 


Ala 


Leu 


Leu 


Gin 












30 








35 










40 






15 


GCT 


CTA 


GGA 


CTG 


CGC 


GAT GAG 


CCC 


CAG 


GGT 


GCC 


CCC 


AGG 


CTC 


CGG 


CCG 


254 




Ala 


Leu 


Gly 


Leu 


Arg 


Asp Glu 


Pro 


Gin 


Gly 


Ala 


Pro Arg 


Leu 


Arg 


Pro 










45 








50 










55 








GTT 


CCC 


CCG 


GTC 


ATG 


TGG CGC 


CTG 


TTT 


CGA 


CGC 


CGG 


GAC 


CCC 


CAG 


GAG 


302 


20 


Val 


Pro 


Pro 


Val 


Met 


Trp Arg 


Leu 


Phe 


Arg 


Arg 


Arg Asp 


Pro 


Gin 


Glu 








6 0 








65 










70 












ACC 


AGG 


TCT 


GGC 


TCG 


CGG CGG 


ACG 


TCC 


CCA 


GGG 


GTC 


ACC 


CTG 


CAA 


CCG 


350 




Thr 


Arg 


Ser 


Gly 


Ser 


Arg Arg 


Thr 


Ser 


Pro 


Gly 


Val 


Thr 


Leu 


Gin 


Pro 


25 




75 








80 










85 














TGC 


CAC 


GTG 


GAG 


GAG 


CTG GGG 


GTC 


GCC 


GGA 


AAC 


ATC 


GTG 


CGC 


CAC 


ATC 


398 




Cys 


His 


Val 


Glu 


Glu 


Leu Gly 


Val 


Ala 


Gly 


Asn 


lie 


Val 


Arg 


His 


lie 




90 










95 








100 








105 




30 


CCG 


GAC 


CGC 


GGT 


GCG 


CCC ACC 


CGG 


GCC 


TCG 


GAG 


CCT 


GTC 


TCG 


GCC 


GCG 


446 




Pro 


Asp 


Arg 


Gly Ala 


Pro Thr 


Arg 


Ala 


Ser 


Glu 


Pro 


Val 


Ser 


Ala 


Ala 














110 








115 










120 








GGG 


CAT 


TGC 


CCT 


GAG 


TGG ACA 


GTC 


GTC 


TTC 


GAC 


CTG 


TCG 


GCT 


GTG 


GAA 


494 


35 


Gly 


His 


Cys 


Pro 


Glu 


Trp Thr 


Val 


Val 


Phe 


Asp 


Leu 


Ser 


Ala 


Val 


Glu 












125 








130 










135 










CCC 


GCT 


GAG 


CGC 


CCG 


AGC CGG 


GCC 


CGC 


CTG 


GAG 


CTG 


CGT 


TTC 


GCG 


GCG 


542 




Pro 


Ala 


Glu 


Arg 


Pro 


Ser Arg 


Ala 


Arg 


Leu 


Glu 


Leu 


Arg 


Phe 


Ala 


Ala 


40 






140 








145 










150 












GCG 


GCG 


GCG 


GCA 


GCC 


CCG GAG 


GGC 


GGC 


TGG 


GAG 


CTG 


AGC 


GTG 


GCG 


CAA 


590 




Ala 


Ala 


Ala 


Ala 


Ala 


Pro Glu 


Gly 


Gly 


Trp 


Glu 


Leu 


Ser 


Val 


Ala 


Gin 






155 








160 










165 












45 


GCG 


GGC 


CAG 


GGC 


GCG 


GGC GCG 


GAC 


CCC 


GGG 


CCG 


GTG 


CTG 


CTC 


CGC 


CAG 


638 




Ala 


Gly 


Gin 


Gly Ala 


Gly Ala 


Asp 


Pro 


Gly 


Pro 


Val 


Leu 


Leu 


Arg 


Gin 




170 










175 








180 








185 






TTG 


GTG 


CCC 


GCC 


CTG 


GGG CCG 


CCA 


GTG 


CGC 


GCG 


GAG 


CTG 


CTG 


GGC 


GCC 


686 


50 


Leu 


Val 


Pro 


Ala 


Leu 


Gly Pro 


Pro 


Val 


Arg 


Ala 


Glu 


Leu 


Leu 


Gly Ala 














190 








195 










200 








GCT 


TGG 


GCT 


CGC 


AAC 


GCC TCA 


TGG 


CCG 


CGC 


AGC 


CTC 


CGC 


CTG 


GCG 


CTG 


734 




Ala 


Trp 


Ala 


Arg 


Asn 


Ala Ser 


Trp 


Pro 


Arg 


Ser 


Leu Arg 


Leu 


Ala 


Leu 





55 



69 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



205 210 215 



GCG 


CTA 


CGC 


CCC 


CGG 


GCC 


CCT 


GCC 


GCC 


TGC 


GCG 


CGC 


CTG 


GCC 


GAG 


GCC 


Ala 


Leu 


Arg 

220 


Pro 


Arg 


Ala 


Pro 


Ala 

225 


Ala 


Cys 


Ala 


Arg 


Leu 

230 


Ala 


Glu 


Ala 


TCG 


CTG 


CTG 


CTG 


GTG 


ACC 


CTC 


GAC 


CCG 


CGC 


CTG 


TGC 


CAC 


CCC 


CTG 


GCC 


Ser 


Leu 

235 


Leu 


Leu 


Val 


Thr 


Leu Asp 
240 


Pro Arg 


Leu 


Cys 

245 


His 


Pro 


Leu 


Ala 


CGG 


CCG 


CGG 


CGC 


GAC 


GCC 


GAA 


CCC 


GTG 


TTG 


GGC 


GGC 


GGC 


CCC 


GGG 


GGC 


Arg 

250 


Pro 


Arg 


Arg 


Asp 


Ala 

255 


Glu 


Pro 


Val 


Leu Gly Gly Gly 
260 


Pro Gly Gly 
265 


GCT 


TGT 


CGC 


GCG 


CGG 


CGG 


CTG 


TAG 


GTG 


AGC 


TTC 


CGC 


GAG 


GTG 


GGC 


TGG 


Ala 


Cys 


Arg 


Ala 


Arg 

270 


Arg 


Leu 


Tyr Val 


Ser 

275 


Phe 


Arg 


Glu 


Val 


Gly Trp 
280 


CAC 


CGC 


TGG 


GTC 


ATC 


GCG 


CCG 


CGC 


GGC 


TTC 


CTG 


GCC 


AAC 


TAC 


TGC 


CAG 


His 


Arg 


Trp 


Val 

285 


He 


Ala 


Pro 


Arg 


Gly 

290 


Phe 


Leu 


Ala 


Asn 


Tyr 

295 


Cys 


Gin 


GGT 


CAG 


TGC 


GCG 


CTG 


CCC 


GTC 


GCG 


CTG 


TCG 


GGG 


TCC 


GGG 


GGG 


CCG 


CCG 


Gly Gin 


Cys 

300 


Ala 


Leu 


Pro 


Val 


Ala 

305 


Leu 


Ser 


Gly Ser 


Gly Gly 
310 


Pro 


Pro 


GCG 


CTC 


AAC 


CAC 


GCT 


GTG 


CTG 


CGC 


GCG 


CTC 


ATG 


CAC 


GCG 


GCC 


GCC 


CCG 


Ala 


Leu 

315 


Asn 


His 


Ala 


Val 


Leu 

320 


Arg 


Ala 


Leu 


Met 


His 

325 


Ala 


Ala 


Ala 


Pro 


GGA 


GCC 


GCC 


GAC 


CTG 


CCC 


TGC 


TGC 


GTG 


CCC 


GCG 


CGC 


CTG 


TCG 


CCC 


ATC 


Gly Ala 
330 


Ala 


Asp 


Leu 


Pro 

335 


Cys 


Cys 


Val 


Pro 


Ala 

340 


Arg 


Leu 


Ser 


Pro 


lie 

345 


TCC 


GTG 


CTC 


TTC 


TTT 


GAC 


AAC 


AGC 


GAC 


AAC 


GTG 


GTG 


CTG 


CGG 


CAG 


TAT 


Ser 


Val 


Leu 


Phe 


Phe 

350 


Asp 


Asn 


Ser 


Asp Asn 
355 


Val 


Val 


Leu 


Arg 


Gin 

360 


Tyr 


GAG GAC 
Glu Asp 


ATG 

Met 


GTG 

Val 

365 


GTG 

Val 


GAC 

Asp 


GAG 

Glu 


TGC 

Cys 


GGC TGC 
Gly Cys 
370 


CGC 

Arg 


TAACCCGGGG CGGGCAGGGA 



CCCGGGCCCA ACAATAAATG CCGCGTGG 



(2) INFORMATION FOR SEQ ID NO:31 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 



782 



830 



878 



926 



974 



1022 



1070 



1118 



1166 



1219 



1247 



70 
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Met Pro Pro Pro Gin Gin Gly Pro Cys Gly His His Leu Leu Leu Leu 
1 5 10 15 

5 Leu Ala Leu Leu Leu Pro Ser Leu Pro Leu Thr Arg Ala Pro Val Pro 

20 . 25 30 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 



71 
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5 



10 



15 . 



20 



25 



30 



35 



40 



45 



50 



55 



Pro Gly Pro Ala Ala Ala Leu Leu Gin Ala Leu Gly Leu Arg Asp Glu 
35 40 45 

Pro Gin Gly Ala Pro Arg Leu Arg Pro Val Pro Pro Val Met Trp Arg 
50 55 60 

Leu Phe Arg Arg Arg Asp Pro Gin Glu Thr Arg Ser Gly Ser Arg Arg 
65 70 75 80 

Thr Ser Pro Gly Val Thr Leu Gin Pro Cys His Val Glu Glu Leu Gly 
85 90 95 

Val Ala Gly Asn He Val Arg His lie Pro Asp Arg Gly Ala Pro Thr 
100 105 110 

Arg Ala Ser Glu Pro Val Ser Ala Ala Gly His Cys Pro Glu Trp Thr 
115 120 125 

Val Val Phe Asp Leu Ser Ala Val Glu Pro Ala Glu Arg Pro Ser Arg 
130 135 140 

Ala Arg Leu Glu Leu Arg Phe Ala Ala Ala Ala Ala Ala Ala Pro Glu 
145 150 155 160 

Gly Gly Trp Glu Leu Ser Val Ala Gin Ala Gly Gin Gly Ala Gly Ala 
165 170 175 

Asp Pro Gly Pro Val Leu Leu Arg Gin Leu Val Pro Ala Leu Gly Pro 
180 185 190 

Pro Val Arg Ala Glu Leu Leu Gly Ala Ala Trp Ala Arg Asn Ala Ser 
195 200 205 

Trp Pro Arg Ser Leu Arg Leu Ala Leu Ala Leu Arg Pro Arg Ala Pro 
210 215 220 

Ala Ala Cys Ala Arg Leu Ala Glu Ala Ser Leu Leu Leu Val Thr Leu 

225 230 235 240 

Asp Pro Arg Leu Cys His Pro Leu Ala Arg Pro Arg Arg Asp Ala Glu 

245 250 255 

Pro Val Leu Gly Gly Gly Pro Gly Gly Ala Cys Arg Ala Arg Arg Leu 
260 265 270 

Tyr Val Ser Phe Arg Glu Val Gly Trp His Arg Trp Val lie Ala Pro 
275 280 285 

Arg Gly Phe Leu Ala Asn Tyr Cys Gin Gly Gin Cys Ala Leu Pro Val 
290 295 300 

Ala Leu Ser Gly Ser Gly Gly Pro Pro Ala Leu Asn His Ala Val Leu 

305 310 315 320 

Arg Ala Leu Met His Ala Ala Ala Pro Gly Ala Ala Asp Leu Pro Cys 

325 330 335 

Cys Val Pro Ala Arg Leu Ser Pro lie Ser Val Leu Phe Phe Asp Asn 
340 345 350 

Ser Asp Asn Val Val Leu Arg Gin Tyr Glu Asp Met Val Val Asp Glu 
355 360 365 
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Cys Gly Cys Arg 
370 



5 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

io (A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

15 . (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



Arg Xaa Xaa Arg 



Claims 

25 i. Use of a morphogen for the manufacture of a medicament for use in: 

(a) stimulating morphogenesis of dentine in a mammalian tooth; or 

(b) stimulating phenotypic expression of mammalian odontoblasts; or 

(c) stimulating production of dentine matrix by mammalian odontoblasts; or 

30 (d) increasing thickness of a mammalian tooth wall; or 

(e) reducing risk of fracture in a mammalian tooth; or 

(f) desensitizing a mammalian tooth to perception of pressure or temperature; or 

(g) sealing a cavity in a mammalian tooth; wherein said medicament is applyied to a dentinal surface. 

35 2. The use of claim 1 wherein the dentinal surface either: (i) adjoins a site of lost or damaged enamel, dentine or 

cementum tissue, such as a cavity, of said tooth, or (ii) adjoins a site of lost or damaged gingival tissue. 

3. The use of claim 1 or claim 2 wherein the dentinal surface has been treated either to:- (i) ablate damaged or 
infected enamel, dentine or cementum tissue from the site of said cavity, or (ii) debride damaged gingival, enamel, 
40 dentine or cementum tissue from said dentinal surface. 



4. The use of any one of claims 1 to 3 comprising the application of said morphogen in an amount effective for 
stimulating formation of reparative dentine apposite said dentinal surface. 

45 5 . The use of claim 4 wherein said dentinal surface is transverse to luminae of dental canaliculi within said tooth. 

6. The use of any one of the preceding claims wherein said dentinal surface is separated from the pulp chamber wall 
of said tooth by up to about 1 mm of residual dentine. 

50 7 . The use of any one of the preceding claims wherein: 



(A) said morphogen is solubilised in a physiologically acceptable vehicle or an evaporative vehicle; or 

(B) said morphogen is adsorbed on a biocompatible, acellular matrix suitable for sealing or filling defects in 
mammalian teeth. 

55. 

8. The use of claim 7(A) wherein the morphogen is solubilized by the preparative step of solubilising said morphogen 
in a physiologically acceptable vehicle or an evaporative vehicle, said vehicle optionally further comprising a co- 
factor that mitigates symptoms associated with tooth damage. 
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9. The use of claim 7(B) wherein the morphogen is adsorbed by the preparative step of adsorbing said morphogen 
on a biocompatible, acellular matrix suitable for sealing or filling defects in mammalian teeth, said matrix optionally 
further comprising a cofactor that mitigates symptoms associated with tooth damage. 

10. The use of any one of the preceding claims wherein said morphogen comprises a dimeric protein that induces 
morphogenesis of mammalian dentine tissue, said dimeric protein comprising a pair of folded polypeptides, the 

. amino acid sequence of each of which comprises 

(i) a sequence sharing at least 70% homology with the C-terminal seven cysteine domain of human OP1, 
residues 38-1 39 of Seq. ID No. 4; 

(ii) a sequence encoded by a nucleic acid that hybridizes under stringent conditions with nucleic acid encoding 
said domain of human OP1 ; or 

(iii) a sequence defined by Generic Sequence 8, Seq. ID No. 2. 

11. The use of claim 10 wherein: (i) said sequence of said morphogen polypeptides is defined by OPX, Seq. ID No. 
3 and/or (ii) the morphogen is obtained from culture medium of morphogen-secreting mammalian cells. 

12. The use of claim 10 or 11 wherein said sequence of said morphogen polypeptides is selected independently in 
each said polypeptide from the sequences of the C-terminal seven cysteine domains of human OP1 , mouse OP1 , 
human OP2, mouse OP2, mouse OP3, Drosophila 60A protein, Xenopus Vgl, mouse Vgr-T, mouse GDF-1, Dro- 
sophila DPP, CBMP2A, CBMP2B, BMP3, BMPS, BMP6 (shown in Seq. ID Nos. 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 24, 
26, 27, 28 and 29), and allelic, phylogenetic and biosynthetic variants thereof. 

1 3. The use of claim 1 0 or 1 1 wherein said sequence of said morphogen polypeptides is selected, in each said polypep- 
tide, from the sequences of the C-terminal seven cysteine domains of human OP-1, human OP-2, mouse OP-1, 
mouse OP-2, mouse OP-3 and Drosophila 60A protein (shown in seq. ID Nos. 4, 5, 6, 7, 24 and 26), and allelic, 
phylogenetic and biosynthetic variants thereof. 

14. The use of claim 10 or 11 wherein said morphogen is solubilised by association with at least one morphogen 
prodomain polypeptide or solubility-enhancing fragment thereof. 



Patentanspruche 

1. Verwendung eines Morphogens zur Herstellung eines Medikamentes zur Verwendung bei: 

(a) Stimulieren der Dentinmorphogenese in einem Saugetierzahn; Oder 

(b) Stimulieren der phanotypischen Expression von Saugetier- Odontoblasten; Oder 

(c) Stimulieren der Erzeugung von Dentinmatrix durch Saugetier- Odontoblasten; Oder 

(d) Erhbhen der Dicke eines Saugetierzahnes; Oder 

(e) Vermindern des Bruchrisikos bei einem Saugetierzahn; Oder 

(f) Desensibilisieren eines Saugetierzahns gegenuber dem Wahrnehmungsvermogen von Druck Oder Tem- 

peratur; Oder 

(g) Versiegeln einer Hohle in einem Saugetierzahn; wobei das Medikament auf eine Dentinal-Oberflache auf- 

gebracht wird. 

2. Verwendung nach Anspruch 1 , bei der die Dentinal-Oberflache entweder: (i) an eine Stelle verlorenen Oder be- 
schadigten Zahnschmelzes, Dentin- Oder Zementgewebes, wie beispielsweise einer Hdhle dieses Zahnes, an- 
grenzt Oder (ii) an eine Stelle eines verlorenen Oder beschadigten Zahnfleisch-Gewebes angrenzt. 

3. Verwendung nach Anspruch 1 Oder Anspruch 2, bei der die Dentinal-Oberflache entweder zum: (i) Abtragen von 
beschadigtem Oder infiziertem Zahnschmelz, Dentin- Oder Zement gewebe von der Stelle dieser Kavitat Oder zum 
(ii) Reinigen von geschadigtem Zahnfleisch-, Zahnschmelz-, Dentin- Oder Zementgewebe von dieser Dentinal- 
Oberflache behandelt wird. 

4. Verwendung nach einem der Anspruche 1-3, die die Anwendung des Morphogens in einer zur Stimulierung der 
Bildung von Reparaturdentin wirksamen Menge umfaBt, die der Dentinal-Oberflache angemessen ist. 
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5. Verwendung nach Anspruch 4, bei der die Dentinal-Oberflache quer zu Hohlraumen von Dentalkanalchen inner- 
halb des Zahns verlauft. 

6. Verwendung nach einem der vorhergehenden Anspruche, bei der die Dentinal- Oberflache von der Pulpakammer- 
wand des Zahns von bis zu ungefahr 1 Millimeter Restdentin getrennt ist. 

7. Verwendung nach einem der vorhergehenden Anspruche, bei der: 

(A) Das Morphogen in einem physiologisch akzeptablen Trager Oder einem VerdunstungstrSger solubilisiert 
ist; Oder 

(B) das Morphogen an eine biokompatible, azellulare Matrix adsorbiert ist, die zum Versiegeln Oder Fullen von 
Defekten in Saugetierzahnen geeignet ist. 

8. Verwendung nach Anspruch 7A, bei der das Morphogen durch den Vorbereitungsschritt, das Morphogen in einem 
physiologisch akzeptablen Trager Oder in einem Verdunstungstrager zu solubilisieren, solubilisiert wird, wobei der 
Trager optional weiterhin einen Co-Faktor umfaBt der mit Zahnbeschadigung verbundene Symptome lindert. 

9. Verwendung nach Anspruch 7B, bei der das Morphogen durch den Vorbereitungsschritt des Adsorbierens des 
Morphogens an eine biokompatible, azellulare Matrix adsorbiert wird, die zum Versiegeln und Fullen von Defekten 
in Saugetierzahnen geeignet ist, wobei die Matrix optional weiterhin einen Co-Faktor umfaBt, der mit einer Zahn- 
beschadigung verbundene Symptome lindert. 

10 . Verwendung nach einem der vorhergehenden Anspruche, bei der das Morphogen ein dimeres Protein umfaBt, 
das die Morphogenese von Saugetier-Dentingewebe induziert, wobei das dimere Protein zwei gefaltete Polypep- 
tide umfaBt, deren Aminos&uresequenz jeweils 

(i) eine Sequenz, die zumindest 70% Homologie mit der C-terminalen 7-Cystein-Domane von menschlichem 
OPt, Reste 38 bis 139 von Sequenz ID Nr. 4, teilt; 

(ii) eine Sequenz, die von einer Nukleinsaure kodiert wird, die unter stringenten Bedingungen mit einer Nukle- 
insaure hybridisiert, die diese Domane von menschlichem OP1 kodiert; Oder 

(iii) eine Sequenz umfaBt, die durch Gattungssequenz 8, Sequenz ID-Nr. 2 definiert ist. 

11 . Verwendung nach Anspruch 10, bei der: (i) die Sequenz dieser Morphogen- Polypeptide durch OPX, Sequenz ID- 
Nr. 3 definiert ist und/oder (ii) das Morphogen aus Kulturmedium Morphogen-sezernierender Saugetierzellen ge- 
wonnen wird. 

12 . Verwendung nach Anspruch 10 Oder 11 , bei der die Sequenz des Morphogen- Polypeptids unabhSngig bei jedem 
dieser Polypeptide aus den Sequenzen der C-Terminalen 7-Cystein-Domanen von menschlichem OP1, Maus- 
OP1 , menschlichem OP2, Maus-OP2, Maus-OP3, Drosophila 60A-Protein, Xenopus Vgl, Maus Vgr-1 , Maus-GDF- 
1, Drosophila-DPP, CBMP2A, CBMP2B, BMP3, BMP5, BMP6 (dargestellt in den Sequenz ID-Nr. 4, 5, 6, 7, 8, 9, 
1 0, 1 1 , 1 2, 1 3, 24, 26, 27, 28 und 29) und Allel-, phylogenetische und biosynthetische Varianten hiervon, ausgewahlt 
ist. 

13 . Verwendung nach Anspruch 10 Oder 11, bei der die Sequenz des Morphogen- Polypeptids bei jedem Polypeptid 
aus den Sequenzen der C-Terminalen 7-Zystein-Domanen von menschlichem OP1, menschlichem OP2, Maus- 
OP1, Maus-OP2, Maus-OP3 und Drosophila 60A-Protein (dargestellt in den Sequenz ID-Nr. 4, 5, 6, 7, 24 und 26) 
und Allel-, phylogenetischen und biosynthetischen Varianten hiervon, ausgewahlt ist. 

14 . Verwendung nach Anspruch 10 Oder 11 , bei der das Morphogen durch Verbindung mit zumindest einem Morpho- 
gen-Prodoman-Polypeptid Oder Loslichkeits-verbessernden Bruchstucks hiervon solubilisiert wird. 



Revendications 

1. Utilisation d'un morphog&ne pour la fabrication d’un medicament pour une utilisation : 
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(a) pour la stimulation de la morphogenfcse de la dentine de la dent d'un mammifere; ou 

(b) pour la stimulation de I'expression phenotypique des odontoblastes de mammiferes; ou 

(c) pour la stimulation de la production de la matrice de la dentine par des odontoblastes de mammiferes; ou 

(d) pour I'augmentation de l’6paisseur de la paroi d’une dent de mammifere; ou 

(e) pour la reduction du risque de fracture d'une dent de mammifere; ou 

(f) pour la desensibilisation de la dent d'un mammifere k la perception d'une pression ou d’une temperature; ou 

(g) pour le scellage d'une cavity d'une dent de mammifere; 

au cours de laquetle le dit medicament est applique sur une surface dentinale. 

.2. Utilisation selon la revendication 1 au cours de laquelle la surface dentinale soit: (i) est contigue k un site de perte 
ou d'endommagement de I'email, de la dentine ou d'un tissu du cement, tel qu'une cavite de cette dent soit (ji) est 
contigue k un site de perte ou d’endommagement d'un tissu gingival. 

3. Utilisation selon la revendication 1 ou 2 au cours de laquelle la surface dentinale a ete traitee soit pour:- (i) pratiquer 
I'ablation d'un email, d’une dentine, ou d'un tissu du cement endommage ou .infecte k partir du site de cette cavite 
soit (ii) debrider 1 'email, la dentine ou le tissu du cement gingival endommage k partir de cette surface dentinale. 

4. Utilisation selon I'une quelconque des revendication 1 k 3 comprenant I'application de ce morphogene en une 
quantite efficace pour stimuler la formation de la dentine reparatrice imposee a cette surface dentinale. 

5. Utilisation selon la revendication 4 au cours de laquelle cette surface dentinale est transversale par rapport aux 
lumferes des canalicules dentaires k I'interieur de cette dent. 

6. Utilisation selon I'une quelconque des revendications pr^cedentes au cours de laquelle cette surface dentinaire 
est s§par6e de la paroi de la chambre de la pulpe de cette dent par jusqu'e environ 1 mm de dentine residuelle. 

7. Utilisation selon I'une quelconque des revendications pr6c6dentes au cours de laquelle: 

(A) le dit morphogene est solubilise dans un v£hicule physiologiquement acceptable ou un v£hicule evapora- 
teur; ou 

(B) le dit morphogene est adsorbe sur une matrice acellulaire, biocompatible, appropriee pour fermer ou com- 
bler des defauts dans les dents de mammiferes. 

8. Utilisation selon la revendication 7 (A) au cours de laquelle le morphogene est solubilise par retape preparatoire 
de solubilisation de ce morphogene dans un v6hicule physiologiquement acceptable ou un v6hicule evaporateur, 
ce vehicule comprenant en outre facultativement un cofacteur qui mitige les symptdmes associes k un dommage 
d'une dent. 

9. Utilisation selon la revendication 7 (B) au cours de laquelle le morphogene est adsorbe par I'etape preparatoire 
d'adsorption de ce morphogene sur une matrice acellulaire biocompatible, appropri6e pour fermer ou combler des 
defauts dans les dents des mammiferes, cette matrice comprenant en outre un cofacteur qui mitige les symptdmes 
associds k un dommage d'une dent. 

10. Utilisation selon I'une quelconque des revendications precddentes au cours de laquelle ce morphogene comprend 
un proteine dirndre qui induit la morphogenese d’un tissu de la dentine d'un mammifere, cette protdine dimdre 
contenant une paire de polypeptides replies, dont la sequence d'aminoacides de chacune d'entr'elles comprend 

(i) une sequence partageant au moins 70 % d'homologie avec le domaine C-terminal k sept cysteines de 
TOPI humaine, les r6sidus 38-139 de la sequence ID No. 4; 

(ii) une sequence codee par un acide nucieique qui s'hybride sous des conditions stringentes avec un acide 
nucieique codant pour ce domaine de I'OPI humaine; ou 

(iii) une sequence definie par la sequence g6n6rique 8, Seq. ID No. 2. 

11. Utilisation selon la revendication 10 au cours de laquelle: (i) cette sequence de ces polypeptides morphogenes 
est definie par OPX, Seq. ID No. 3 et/ou (ii) on obtient le morphogene k partir d'un milieu de culture de cellules 
de mammifere s6cretant un morphogene. 
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12. Utilisation selon la revendication 10 ou 11 au cours de laquelle cette sequence de ces polypeptides morphog&nes 
est choisie ind^pendamment dans chacun de ces polypeptides provenant des sequences des domaines C-termi- 
naux k sept cysteines de I'OPI humaine, OP1 de souris, OP2 humaine, OP2 de souris, OP3 de souris, proteine 
60A de la drosophile, Vgl de xenope, Vgr-1 de souris, GDF-1 de souris, DPP de drosophile, CBMP2A, CBMP2B, 
BMP3, BMP5, BMP6 (months dans les Seq. ID No. 4, 5, 6, 7, 8, 9, 1 0, 1 1 , 1 2, 1 3, 24, 26, 27, 28 et 29), des variants 
allies, phylog6n6tiques et biosyntttetiques de ceux-ci. 

13. Utilisation selon la revendication 10 ou 11 au cours de laquelle cette sequence de ces polypeptides morphogfenes 
est choisie, dans chacun de ces polypeptides, k partir des sequences des domaines C-terminaux k sept cysteines 
de I'OPI humaine, OP2 humaine, OP1 de souris, OP2 de souris, OP3 de souris et de la proteine 60A de la dro- 
sophile (monttes dans les Seq. ID No. 4, 5, 6, 7, 24 et 26), et les variants alteles, phylog6n6tiques et biosyntttetiques 
de ceux-ci. 

14. Utilisation selon la revendication 10 ou 11 dans laquelle le dit morphog&ne est solubilise par association avec au 
moins un polypeptide du prodomaine du morphog^ne ou du fragment d'activation de la solubility de celui*ci. 
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